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Advertorial 


innovation landscape 


On October 21, Wuhan University and the Institute of Scientific & Technical Information 
of China issued a comprehensive report on the current landscape of global enterprise 
innovation. Titled, “Global Enterprise Innovation Index 2023 (GEII 2023)," the report aims 
to support enterprises and policymakers across economies in understanding the latest 
trends of global innovation in science and technology. 


Modern societies collectively expect new inventions, new gadgets, new services, 
new productivity, and new industries—all in the name and notion of innovation. 
Innovation is a cornerstone of today's economic growth as we know it. For 
companies around the world, “innovation” has long evolved beyond a buzzword into 
a deeply rooted philosophy for securing and sustaining business success. 

Research on innovation and the impact and prediction thereof is nothing new. 
However, previous reputable reports on global innovation, such as the Global 
Innovation Index published annually by the World Intellectual Property Organization 
or the Technology and Innovation Report issued by the United Nations Conference 
on Trade and Development, largely focused on rankings of nations’ productivity 
capacity linked to economic growth. Companies categorized as global enterprises 
operating businesses in diverse cultures and economies are usually considered 
a contributing factor to a country or region's innovation status. In a way, these 
companies could be muffled by their home country’s representation through which 
the enterprises’ innovation was primarily described as a part of their home country’s 
R&D investment. 

Evaluating and repositioning global enterprises’ leadership role in innovation 
is timely, especially post-pandemic. Science and technology will rely on global 
enterprises to reach a much broader population than ever. In turn, the global 
market's demanding emerging industries will drive more fundamental research to 
advance our understanding of science and realization of new technology. 

To provide a comprehensive look at global enterprises’ innovation capacity, the 
Global Enterprise Innovation Index 2023 (GEII 2023) identified three key groups of 
innovation indicators—knowledge, technology, and collaboration—while subdividing 
the three groups into nine second-level and 32 third-level indicators. This system 
broke the existing innovation index format by moving the global enterprises to the 


New report reveals major change in global enterprise 
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center of attention while emphasizing emerging industries, namely new energy, 
biomedicine, advanced manufacturing, new materials, and next-generation 
information technology. By shuffling the priorities, the report showed inspiring 
results while still keeping the world’s three major economic regions—Asia, Western 
Europe, and North America—in context. For example, when looking at combined 
innovation capacity in all evaluated fields, Japan leads the race with the United 
States following right after. China has the second largest economy in the world, 
and has traditionally pivoted around manufacturing, exports, and investment. Now 
China is showing strong momentum in next-generation information technology. The 
report also indicated a significant gap Chinese global enterprises need to bridge in 
innovation capacity to develop at the same level as current enterprise leaders. 

“Our newly discovered three-pillar understanding of current world economic 
development driven by enterprise innovation disclosed a leadership position in 
global innovation for East Asian enterprises that traditionally was dominated by 
Western Europe and the United States,” writes Ying Huang and Lin Zhang, chief 
scientists of the report. 

Japan has 151 leading innovative global enterprises. The United States secured 
second place with 147 enterprises, shy of Japan by only four spots. Germany and 
China took the third and fourth places, with 45 and 41 enterprises, respectively. 

This report is just the beginning of a much-needed persistent effort to reflect 
current economic development. Dean of Wuhan University’s Big Data Institute and 
Director of the GEIl 2023 expert committee Feicheng Ma says, "We hope to continue 
adjusting and optimizing the index system, expanding selection of emerging 
industries, and systematically peeling back the layers of innovation driven by 
global enterprises.” 


Sponsored by 


es i SRIRAS (ACI) MIB RLTTSTRR 
institute of Data imeligence, Wuhan University 


IMAGE: © JAMESTEOHART/SHUTTERSTOCK.COM 


PHOTO: DAVE BARNES/BAS 


135681334 ae 


Genetic population structure of the 
circum-Antarctic octopus Pareledone turqueti 
shows that the West Antarctic Ice Sheet 
collapsed completely during the Last Interglacial. 
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EDITORIAL 


2024 looms 


ast week, Science reflected on major achieve- 
ments in science in 2023, from weight loss drugs 
and a malaria vaccine to exascale computing and 
advances in artificial intelligence. These are all 
impressive developments and provide yet more 
testimony to the power of science to continually 
expand the quality of our lives while deepening 
our understanding of the world. Even so, it’s hard to 
end the year without some worries about 2024. Wars 
in Ukraine and Gaza will grind on in the new year, and 
the United States is headed toward perhaps the most 
consequential and divisive presidential election in more 
than 160 years. These events—and similar ones around 
the world—will challenge the cohesiveness and deter- 
mination of the scientific community 
as never before. 

The situation in Argentina could 
be a harbinger of the danger ahead. 
There, voters elected Javier Milei, “a 
bombastic, right-wing politician,’ in 
the wake of severe economic prob- 
lems in the country. Milei has vowed 
to shrink or eliminate the leading sci- 
ence agency in Argentina, which he 
has said is “unproductive,” despite 
wide international recognition of its 
importance. Similar cries are likely to 
emerge in the US election, where an 
independent-party candidate, Robert 
F. Kennedy Jr., has already vowed to 
suspend government-sponsored re- 
search on infectious diseases, and Republican primary 
candidate Vivek Ramaswamy is calling climate change 
a hoax. Neither has a prayer of winning, but they stand 
as a warning of how demagogues can mislead voters 
with their ignorance and exploit a growing sense of sus- 
picion of scientific expertise. 

In the US, confidence in science continues to decline 
and become more polarized. Only 57% of Americans 
now believe that science has had a positive effect on so- 
ciety, down 16 points since the start of the severe acute 
respiratory syndrome coronavirus 2 (SARS-CoV-2) out- 
break. This division is evident in a recent study show- 
ing that although conservatives and liberals have the 
same tendency to believe conspiracy theories, those be- 
liefs that involve science—such as denials of COVID-19, 
the efficacy and safety of vaccines, and human-caused 
climate change—skew far to the right in their level of 
acceptance. Even unfounded doubts about genetically 


“We are truly 
living in two 
worlds—one of 


discovery and 
wonder and one 
of division.” 


modified crops, which were once held mostly by those 
on the left, have moved to the center. With an eroding 
base of agreement on what counts as well-supported 
fact, it’s now even harder to stand up for science with- 
out appearing partisan. 

The solution to this is daunting but doable. The 
scientific community must figure out how to speak 
out against anti-science attitudes and beliefs with- 
out sounding condescending and furthering populist 
paranoia. It doesn’t help that public confidence in US 
higher education institutions, where most research is 
conducted, has plummeted to 38%. Last month, US 
ambassador to Japan Rahm Emanuel was on the mark 
when he told Sudip Parikh, chief executive officer of 
the American Association for the 
Advancement of Science (AAAS, the 
publisher of Science), that scientists 
need to start speaking with “outside 
voices.” “You can have all your meet- 
ings with each other,” he said, “share 
papers, do peer reviews, but still, 
what you do will be under attack.” 
As these surveys show, this is a prob- 
lem the scientific community has not 
cracked. It must look for new ways to 
reach the public and better support 
those who are directly engaging with 
the public and trying to broaden un- 
derstanding of science as humanity’s 
evolving attempt to discover how the 
world works and how a deep under- 
standing of nature can be harnessed to benefit every- 
one, regardless of political persuasion. 

Thus, the scientific community must get its own 
house in better order. It should pay attention to the 
morale and welfare of trainees and students, call out 
and address bias and discrimination in its processes, 
and ensure the production of a reliable scientific record 
through a correction mechanism that is expedient and 
transparent. Missteps in these realms make it easier for 
those who are always looking for the slightest excuse to 
amplify their attacks on science. 

We are truly living in two worlds—one of discovery 
and wonder and one of division. This means that scien- 
tific leadership must be simultaneously bold and pasto- 
ral. We all must stand up for science while also caring 
for the community that produces it. This is no easy task, 
but it’s never been more important. 

-H. Holden Thorp 


H. Holden Thorp 
Editor-in-Chief, 

Science journals. 
hthorp@aaas.org 
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Estimated share of bird species driven extinct by humans, mostly 


on Pacific islands. The number of extinct species, 1500, is double 


previous estimates. The new one relies on fossils of species that went 


extinct before being formally recorded. (Nature Communications) 


Amateur fossil hunter 
Steve Etches helped 
excavate the 2-meter- 
long fossilized skull of a 
carnivorous pliosaur. 


Near-perfect pliosaur skull hints at big bite 


U.K. team that includes amateur fossil hunters is 
studying the recently discovered skull of a pliosaur, 
a large, carnivorous marine reptile that dominated 
ancient oceans about 150 million years ago, news 
media reported last week. The specimen was found 
in 2022 embedded in a cliff along Dorset’s famed 
Jurassic Coast, where the pioneering 19th century ama- 
teur fossil hunter Mary Anning once discovered skeletons 
of other marine dinosaurs. Scientists are calling the new 


HHMI wins disability lawsuit 


LEGAL AFFAIRS | A Maryland jury last week 
decided that the Howard Hughes Medical 
Institute (HHMDJ) did not discriminate in 
2018 when it failed to renew its gener- 

ous investigator award to Vivian Cheung, 
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an RNA biologist at the University of 
Michigan who is disabled. In 2014, Cheung 
learned she has a rare genetic disorder 
that has impaired her immune system 

and vision. When she asked the wealthy 
research philanthropy for employment 
accommodations, it began to discriminate 


skull, about 2 meters long, one of the most complete of 
its kind. After measuring it, paleontologist Emily Rayfield 
of the University of Bristol calculated the creature's bite 
force could have reached 33,000 newtons—about double 
that of modern-day crocodiles. Other studies have esti- 
mated that some pliosaurs’ entire bodies measured up to 
12 meters long. The fossil has not yet been described in a 
scholarly article or preprint but is to be featured in a BBC 
documentary debuting on 1 January 2024. 


against her, she alleged in a lawsuit she 
filed against HHMI in 2020. She said that 
HHMI threatened to terminate her if she 
didn’t return to her Michigan lab from 
Maryland, where she had been receiving 
medical care and doing research. HHMI 
argued that it didn’t renew Cheung’s 
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multimillion-dollar grant because her sci- 
ence wasn’t good enough. In 2018, 

20 external reviewers gave Cheung 14 C’s 
and six B-minuses. Advocates for disabled 
scientists had followed the 8-day trial and 
noted that disability discrimination claims 
are particularly hard to prove in court. 


Scientists help clear mother 


FORENSICS | An appeals court in Australia 
last week officially struck down the convic- 
tions of an Australian woman, Kathleen 
Folbigg, for killing her four young children 
decades ago. A jury found Folbigg guilty 

in 2003 for the deaths despite a lack of 
direct evidence. Since 2019, geneticists 
and other researchers have gathered and 
publicized data suggesting her children 
likely died from undiagnosed medical 
conditions, which helped Folbigg win a 
pardon and release from prison earlier this 
year. On 14 December, she was present in 
the courtroom as the appeals court called 
the original evidence used against her at 
trial “unreliable” and its chief justice spoke 
of the “substantial and extensive body” of 
new evidence the scientists had provided. 
The Australian Academy of Science, which 
acted as an independent adviser for an 
official inquiry into the conviction, said 
Australia should set a standard for the reli- 
ability of evidence, as other countries have. 


WHO targets disfiguring disease 


PUBLIC HEALTH | The World Health 
Organization (WHO) last week added 
noma, a disease that disfigures the face and 
can be fatal if not treated, to its official list 
of neglected tropical diseases. The designa- 
tion is intended to attract more research 
and funding for curbing the disease, also 
known as cancrum oris and gangrenous 
stomatitis. More than 30,000 cases are 


A5-year-old girl in Nigeria, her face disfigured by noma, 
travels with her mother to a hospital for treatment. 
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estimated to occur annually, predomi- 
nantly in malnourished children in remote 
parts of Africa; specialists suspect more go 
unreported. WHO says the fatality rate is 
as high as 90%, although early treatment 
with antibiotics is effective. Researchers 
think bacteria and poor oral hygiene help 
cause the disease, which begins as an 
inflammation of the gums and evolves into 
a gangrene that damages the nose, mouth, 
and eyes. Survivors face stigma, and some 
need reconstructive surgery. WHO’s list- 
ing comes after years of advocacy to raise 
awareness about the disease, including a 
January request from Nigeria and 

32 other countries with significant num- 
bers of noma cases. 


Fallen M.D. soon heads to prison 


CRIMINAL JUSTICE | Paolo Macchiarini, 
the former surgeon sentenced to 2.5 years 
in prison by a Swedish court for aggra- 
vated assault against patients he treated, is 
expected to begin his sentence sometime 

in coming weeks. As described by a Netflix 
documentary and the true crime drama 

Dr. Death, three patients died after suffering 
complications from experimental trachea 
transplants he performed at the Karolinska 
Institute; the court ruled that Macchiarini 
was criminally culpable because he knew 
the surgeries were unlikely to succeed and 
disregarded the risks to the patients. On 

30 October, Sweden’s highest court turned 
down his appeal, triggering a 90-day dead- 
line for authorities to determine when and 
where he will report to prison. Sweden does 
not publicly release such details. Observers 
say Macchiarini may be imprisoned in 
Spain, where he lives, because EU rules 
encourage allowing convicted people to 
serve their sentence in the country in which 
they are most likely to be successfully reha- 
bilitated and reintegrated into society. 


Compounds nearly diamond-hard 


MATERIALS SCIENCE | Researchers 
reported this month synthesizing carbon 
nitride materials with a hardness sec- 
ond only to that of diamond, the hardest 
known mineral. Predicted in the 1980s, 
such a carbon nitride was first made by 

a U.S. group in 2016, but the material fell 
apart if not continually squeezed under 
intense pressure. Scientists reported on 
10 December in Advanced Materials that 
by compressing carbon and nitrogen 
precursors to pressures similar to those in 
Earth’s mantle, they made three different 
carbon nitrides, all of which proved to be 
superhard and stable when the pressure 
was released. The authors say it’s difficult 


CLIMATE POLICY 


Fossil fuel pledge panned 


Nearly 200 countries agreed last week at 
the United Nations's 28th Conference of 
the Parties climate summit to “transition 
away from” fossil fuels. Some observers 
called it a political milestone. But many 
climate scientists described the non- 
binding pledge—which asks countries to 
develop new plans within 2 years to curb 
greenhouse gas emissions, including 
methane—as barely a step forward. 


GG It's like promising your 
doctor that you will ‘transition 
away from doughnuts’ 
after being diagnosed with 
diabetes. 99 


Michael Mann, University of Pennsylvania, 
in The Guardian 


46 With every empty promise, 
millions more people will enter 
the front line of climate change 


and many will die. 99 


Friederike Otto, Imperial College London, 
in The Guardian 


to synthesize more than the tiny amounts 
they derived, but they speculate these crys- 
tals could serve as seeds for growing larger 
amounts of the material. 


NIH not auditing foreign grantees 


FUNDING | A federal watchdog office 

has faulted the U.S. National Institutes of 
Health (NIH) for failing to collect annual 
audit reports from foreign grantees. The 
agency received only 28 of 109 reports 
required from 55 of its grant recipients 
that received at least $750,000 in federal 
funding in fiscal years 2019 and 2020, 
according to the 14 December report by 

the Department of Health and Human 
Services’s Office of Inspector General. Most 
institutions lacking audits were in Europe, 
Canada, and Africa. NIH responded that 

it plans to obtain the missing reports by 
September 2024. In January, the inspector 
general found that NIH failed to adequately 
monitor grants to the nonprofit EcoHealth 
Alliance that included subawards to China’s 
Wuhan Institute of Virology—which some 
conservatives have accused, without evi- 
dence, of sparking the COVID-19 pandemic. 
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Blown-out windows after 
a 2012 shooting at Sandy 
Hook Elementary School in 
Newtown, Connecticut. 


NRA donations spike in school shooting counties 


Parallel surges for a gun control advocacy group not seen 


By Meredith Wadman 


chool shootings are tragically familiar 

in the United States. But a new analy- 

sis shows they have a startling con- 

sequence. In the very counties where 

they occur, the National Rifle Associa- 

tion (NRA) sees a surge in donation 
amounts and in new donors that lasts sev- 
eral years after shootings, according to an 
analysis of 131 school shootings with at least 
one fatality between 2000 and 2022. 

The finding, published this week in Science 
Advances, echoes well-documented surges 
in gun purchases and donations to NRA, the 
leading U.S. gun rights lobbyist, after mass 
shootings. But the hyperlocal effect is new 
and revealing, researchers not involved in 
the study say. “Either there’s a fear ... that 
someone is going to take your gun away ... 
and/or a fear that you are somehow less safe 
and therefore need more guns,” says Kerri 
Raissian, who directs the Center for Advanc- 
ing Research, Methods, and Scholarship in 
Gun Injury Prevention at the University of 
Connecticut. “It says that there is something 
about this event happening in your county, 
local to you, that makes this threat more real.” 

Tobias Roemer, a Ph.D. student in politi- 
cal science at the University of Oxford, which 
funded the study, used a U.S. Federal Election 
Commission database to track donations af- 
ter school shootings in 131 counties as well 
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as in the 2758 counties without such school 
shootings. He found that donations to NRA’s 
political action committee, the Political Vic- 
tory Fund, climbed on average $970 per year 
in the affected counties in the 4 years follow- 
ing a school shooting. The number of indi- 
vidual donors in those counties grew by as 
much as 40%. He found no parallel surges 
in NRA donors and donations in unaffected 
counties. At the same time, donations to the 
political action committee of Giffords, which 
lobbies for gun control and research, did not 
increase in affected counties. 

Roemer also found no increase in dona- 
tions to organizations unconnected to guns, 
such as antiabortion and abortion rights 
groups, which ruled out other causes of a 
more general giving increase, such as a tax 
benefit deadline. To judge whether donors 
were responding to active NRA solicita- 
tions, he examined the group’s advertising 
buys on Facebook and Instagram and found 
no increases in the weeks after shootings. 
Roemer says his paper “focuses on a tangible 
variable. Money is real. It’s not cheap talk.” 

The results come even as NRA, which did 
not respond to requests for comment, deals 
with an ongoing fraud lawsuit from New 
York state’s attorney general. 

The study “introduces a novel data set 
and a very compelling, straightforward ... 
analysis,’ says Bradley Bartos, a crimino- 
logist at the University of Arizona who 


wrote an accompanying editorial. Laura 
Garcia-Montoya, a political scientist at the 
University of Toronto, conducted a similar 
county-level study published in 2021 that 
found school shootings with multiple vic- 
tims drive increases in Democratic electoral 
support in those counties. The new paper “is 
showing the other side of the spectrum,’ she 
says, adding that the new findings “make a 
really powerful statement ... that ... people 
who support gun rights also feel threatened 
by school shootings and so they feel the 
need to help their interest group.” 

Some academic critics argued the ab- 
sence of a surge in donations to the Giffords 
organization does not mean the shootings 
had no effect on support for gun safety. The 
group was launched in 2013 and at that 
time had far lower name recognition than 
the 150-year-old NRA. And it represents 
only a slice of the gun control lobby, being 
one of three leading gun control groups 
that together outraised NRA by $10 million 
in the 2021-22 election cycle. 

“The paper unfortunately relies on ar- 
bitrary slices of data to reach obviously 
flawed conclusions,” says Chris Harris, vice 
president of communications at Giffords. 
“Knowledgeable observers on both sides of 
the gun debate know that the gun safety 
movement is growing exponentially while 
the NRA suffers from shrinking member- 
ship rolls and shrinking bank accounts.” 
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‘Gene drive’ may help fight viruses with viruses 


CRISPR-engineered herpesviruses can speed spread of genes to viral relatives in mice 


By Jon Cohen 


f your immune system or drugs can’t 

stop a viral infection, why not pit a 

virus against itself? That’s the pro- 

vocative idea several labs are pursuing. 

They are studying whether deliber- 

ately introducing engineered viruses 
into people infected with their natural 
relatives can “drive” a foreign gene into 
those viruses that ultimately wipes out 
an infection. 

No lab has knocked down an infection in 
animals this way yet. But a group has now 
shown it’s theoretically possible, using a 
so-called gene drive that harnesses the ge- 
nome editor CRISPR to perform molecular 
surgery that spreads a gene among viruses. 

So far, efforts to add gene drives to ani- 
mals such as rodents and mosquitoes to 
control their numbers have drawn the most 
attention. In a preprint this month, how- 
ever, one team reports a similar feat with 
herpesvirus-1 (HSV-1) in mice. When both 
engineered and unmodified herpesviruses 
were inoculated into the rodents, the gene 
drive converted up to 90% of the viruses— 
possibly enough to prevent an HSV-1 infec- 
tion from causing symptoms such as painful 
cold sores. A second group has succeeded in 
putting gene drives into HSV-1 that is grow- 
ing in infected cells in lab dishes. 

The viral gene drive work is a long way 
from curing an infected person. No one 
knows, for example, what kind of genetic 
modification the drive should propagate 
to drive down an infection. But other sci- 
entists see its potential. The new studies 
are “compelling,” says Rebecca Shapiro, 
a molecular microbiologist at the Uni- 
versity of Guelph who has experimented 
with gene drives in fungi. “This opens 
up a lot of exciting avenues to use these 
kinds of gene drive techniques to modify 
viral populations, and possibly use them as 
new therapeutics.” 

Researchers have developed gene drives 
mostly in animals that sexually reproduce. 
They’ve created genetically modified fe- 
males or males whose odds of spreading 
a gene to their offspring are significantly 
higher than 50%—the typical Mendelian 
chance that a descendant will inherit a 
particular gene variant. Most experiments 
aim to spread modifications that kill the 
offspring or render them sterile, a possible 


SCIENCE science.org 


strategy for pest control. This work has only 
been done in labs so far, however, because 
of concerns that releasing animals with 
gene drives could have dire consequences, 
such as accidentally wiping out an entire 
species or harming animals that aren’t 
the target. 

Unlike people, viruses don’t have gene- 
scrambling sex with each other when 
they replicate. They simply command 
infected cells to read their genes and 
produce new viruses. But if multiple viri- 
ons infect a single cell—as happens with 
herpesviruses—they often do something 
akin to sex, randomly swapping genetic 


particular problem with people who have 
untreated HIV infections or receive organ 
or bone marrow transplants. 

Three years ago, virologists Marius 
Walter and Eric Verdin at the Buck Insti- 
tute for Research on Aging demonstrated 
that a gene drive might be feasible in a 
herpesvirus. They showed in cell culture 
experiments that hCMVs endowed with a 
gene drive built from CRISPR could spread 
the drive to other hCMVs in infected cells. 
The gene drive mainly consisted of genetic 
sequences coding for CRISPR’s two compo- 
nents: a DNA-cutting enzyme known as Cas 
and a guide RNA that directs the enzyme 


When multiple herpesviruses infect a cell’s nucleus (shown here), they sometimes exchange genes, a behavior a 
“gene drive” might exploit to wipe them out. 


sequences inside the nucleus. A gene drive 
hijacks this natural process, known as re- 
combination, to introduce and spread ge- 
netic changes that might ultimately disable 
a whole population of viruses. 

HSV-1, HSV-2, human cytomegalovirus 
(hCMV), Epstein-Barr virus, and other 
herpesviruses are particularly attrac- 
tive candidates for tackling with a gene 
drive because they can cause lifelong, la- 
tent infections that periodically flare and 
cause symptomatic disease. Some drugs 
can tamp down viral reactivation, but in 
people who have compromised immune 
systems, those drugs often fail and herpes- 
viruses run riot, damaging many parts of 
the body and even causing death. This is a 


to a specific place in a viral genome, where 
it makes a break, enabling a copy of the 
CRISPR-Cas genetic cassette to insert itself. 
The researchers attached a gene for a fluo- 
rescent marker to the complex so they could 
track the drive as a chain reaction spread it 
to yet more herpesviruses. 

The study also hinted that a gene drive 
could reduce, if not eliminate, a viral popu- 
lation. The guide RNA shuttled the drive 
to park inside a specific viral gene, one 
encoding a protein that protects the patho- 
gen from immune attack. The insertion 
disabled the gene, reducing levels of the 
protective protein and rendering the virus 
more vulnerable. 

The paper, which appeared in Nature 
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Communications in September 2020, re- 
ceived scant attention. “Publishing a paper 
during a pandemic that’s happening with 
another virus is not a great way of having 
people notice it,’ says Walter, who was a 
postdoctoral researcher in Verdin’s lab and 
has since moved to Keith Jerome’s lab at 
the Fred Hutchinson Cancer Center. 

The few researchers who did read it were 
skeptical, questioning whether a viral gene 
drive would cripple enough viruses to mean- 
ingfully reduce the population, Verdin re- 
calls. “One of the arguments that we heard 
is, ‘Oh, you’re never going have enough of 
the two viruses together in this same cell to 
actually make it work’” But the 8 December 
preprint by Walter, Verdin, Jerome, and col- 
leagues now shows that in at least some in- 
fected mouse tissues, a gene drive can spread 
genes through much of the viral population. 

Walter and Verdin view their work as 
a proof-of-concept experiment, as does a 
team led by virologist Dai Hongsheng of 
China’s Southern Medical University and 
his colleagues, which describes its cell cul- 
ture success with an HSV-1 gene drive in a 
4 December preprint on bioRxiv. In tissue 
cultures, HSV-1 gene drives worked better 
than the hCMV ones, Dai notes. “The gene 
drive in HSV-1 spread steadily and at a 
speed not achievable with hCMV,” he says. 

But the mouse experiment suggests 
the HSV-1 gene drive needs to move even 
faster. As shown by the fluorescent marker 
gene it carried, the drive spread at differ- 
ent rates across the mice’s body regions, 
with the variation being most pronounced 
in the brain: In the cortex, the marker 
propagated to 25% of the viruses, whereas 
it ended up in as much as 90% of the vi- 
ruses in the brain stem. Nonetheless, 
Walter believes the results suggest the 
technology might one day help people. 
“You have enough coinfection occurring in 
vivo for a gene drive to be something that 
we can potentially use,” he says. 

All of the involved researchers stress that 
gene drives for herpesviruses are far from 
ready for tests in humans. Walter plans fu- 
ture experiments in animal models, hoping 
to show whether a gene drive can suppress 
a chronic HSV-1 infection enough to pre- 
vent reactivation and disease. 

Britt Glaunsinger, a biochemist at the 
University of California, Berkeley who 
specializes in studying herpesviruses, sees 
the promise of the concept, but cautions 
that the modified viruses could cause dis- 
ease themselves—especially in immuno- 
compromised people—if the gene drive 
does not sufficiently weaken the viruses 
it targets. Still, she wrote in an email, “I 
look forward to seeing where the field 
takes this!” 
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NIH advisory group pushes for 
big pay raise for U.S. postdocs 


Though applauded by many, measure would strain lab 


budgets if adopted 


By Katie Langin 


he postdocs who do much of the labor 

of academic science have long felt over- 

worked and underpaid. If an advisory 

group to the U.S. National Institutes 

of Health (NIH) gets its way, condi- 

tions for some of these early-career 
scientists will soon improve. The minimum 
starting salaries of NIH-supported postdocs 
should increase to $70,000 in 2024, up from 
the current minimum of $56,484, the group 
concluded in a report released last week. “We 
feel that this is crucial to ... protect the U.S. 
biomedical competitiveness,’ said co-chair 
Shelley Berger, an epigeneticist at the Univer- 
sity of Pennsylvania, during the presentation 
unveiling the recommendations. 

The group also recommended a 5-year cap 
on postdoc positions; an expansion of health 
care, retirement, and child care benefits; in- 
creased support for international postdocs; 
and other measures to improve postdocs’ 
lot. But how lab heads would pay their staff 
more out of their constrained research grants 
remains an open question. An NIH spokes- 
person says the agency’s new director, 
Monica Bertagnolli, will carefully review the 
report with her leadership team. 

Many of the recommendations echo those 
of a 2012 NIH advisory group. The urgency 


Salary gap 


The minimum salary for starting postdocs funded through the National 
Institutes of Health (NIH) has consistently been lower than the average 
starting salaries for U.S. biological and biomedical Ph.D.s in other sec- 


tors. Arecommended increase would narrow the gap. 
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is greater now, working group members say. 
“This time will be different because the econ- 
omy is different,’ says one, economist Donna 
Ginther at the University of Kansas. “Poten- 
tial postdoctoral scholars are skipping the 
postdoc altogether because they can make 
more money [elsewhere].” 

The proposed salary increase was over- 
due, some postdocs say. “It has long been 
clear that systemic change is necessary,’ says 
University of California, Santa Cruz postdoc 
Neal Sweeney, who is president of the union 
that represents UC postdocs. Postdoc unions 
at UC and elsewhere have already won raises. 

Many PIs see the need for change, but are 
also left wondering how they would maintain 
their research programs given the higher 
postdoc salaries. After the presentation, sev- 
eral attendees worried that junior professors 
and underresourced institutions such as 
historically Black colleges and universities 
might particularly struggle. One attendee 
also wondered whether the proposed 5-year 
cap on postdoc positions was feasible given 
the “practicalities in the real world.” 

Working group co-chair Tara Schwetz, 
NIH’s deputy director for program coordi- 
nation, planning, and strategic initiatives, 
acknowledged the recommendations, if en- 
acted by NIH, “may result in fewer postdocs 
that are being supported.” Given the limits 
of the budget, “We know that 
there are priorities and tough 
decisions that are going to 
have to be made.” Schwetz 
also emphasized that “it’s 
critical to engage other gov- 
ernmental agencies, organi- 
zations, institutions—they’re 
going to be key partners in 
success.” 

At the end of the pre- 
sentation, Bertagnolli em- 
phasized that NIH took 
seriously its responsibility to 
“think long and hard” about 
unanticipated consequences 
of the proposed recommen- 
dations. “This is a big step. 
I completely agree a critical 
one. But we just want to do 
no harm.” 


Nonprofit 
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San Andreas fault may shake 
less than once feared 


Precariously balanced rocks near Los Angeles hold clues 
to the hazard of a giant earthquake 


By Paul Voosen, in San Francisco 


omeday, a great earthquake will erupt 
from the San Andreas fault, which cuts 
through Southern California from Los 
Angeles to San Francisco. Geologic re- 
cords make it clear. It has happened, 
and it will happen again. 

But when the Big One does hit, it may be 
less devastating than once thought, at least 
near Los Angeles. According to new work 
presented last week at a meeting of the 
American Geophysical Union, the ground 
there will shake up to 65% less violently than 
official hazard models suggest. 

The good news for Angelenos stems from 
five rocks balanced precariously on top of 
other rocks in Lovejoy Buttes, a place in 
northern Los Angeles County that sits just 
15 kilometers from the fault. By dating when 
the rocks first became fragile and analyzing 
their structures to assess the maximum shak- 
ing they could withstand, the researchers 
could test official predictions against thou- 
sands of years of earthquakes. Those predic- 
tions have been found wanting, says Anna 
Rood, a seismic hazard scientist at the Global 
Earthquake Model Foundation who led the 
work, which is accepted in Seztsmological 
Research Letters. “The hazard estimates are 
totally inconsistent with these precariously 
balanced rock data.” 

The new study is a welcome advance for an 
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emerging technique, says Daniel Trugman, 
a seismologist at the University of Nevada, 
Reno. “They’ve applied probably the most 
rigorous methodology that I’ve ever seen to 
try and solve this problem.” 

“This is good data,’ adds Mark Petersen, 
a geophysicist at the U.S. Geological Survey 
(USGS) who leads development of the U.S. 
National Seismic Hazard Model. It’s too early 
to incorporate the findings when the hazard 
maps are updated next month, Petersen says, 
especially given the maps’ conservative na- 
ture. But if the findings hold up, Rood says, 
“There are real implications for the people 
living in this area.” Seismic hazard maps go 
into building codes, disaster preparation, and 
land use planning. And if you insure a house 
in California, she says, “They dictate what 
your premium will be.” 

For as long as scientists have sought to 
calculate seismic hazard, they have struggled 
to predict the shaking caused by the largest 
earthquakes, the one-in-10,000-year disasters 
evident in the geologic record but never expe- 
rienced in modern times. Paleoseismic tech- 
niques offer limited help. Subtle sediment 
shifts seen in trenches dug across the faults 
record the ancient quakes but say nothing 
about the shaking that occurred, which is a 
complex function of the way a fault ruptures 
and the local geology. So seismologists con- 
struct their shake maps based on extrapola- 
tions from historical quakes. “The maps come 


Stickers aid the digital re-creation of precarious 
boulders at Lovejoy Buttes, near Los Angeles. 


from the best available science, but there’s a 
lot of uncertainty in them,” Trugman says. 

The precarious rocks, however, actually 
witnessed the ancient quakes—and kept 
standing. Wind and water have eroded these 
granite rocks, often into hourglass shapes in 
which boulders perch on top of platforms. 
During trips in 2021 and 2022, Rood and her 
co-authors chiseled surfaces up and down the 
rocks and dated them by measuring levels of 
beryllium-10, an isotope created when rocks 
are exposed to the atmosphere and a steady 
bombardment of cosmic rays. By extracting 
dates from the narrow neck where the boul- 
ders meet their platform, they could deter- 
mine when the rocks first became precarious. 
One rock, it turned out, had been teetering in 
its delicate position for 50,000 years. “That 
this rock has been recording and surviving 
these earthquake events for 50,000 years is a 
crazy amount of data,” Rood says. 

Like actors in a motion-capture movie, 
the rocks were also tagged with stickers so 
their shapes could be digitally re-created. By 
analyzing the shapes in structural models, 
the team calculated how much shaking they 
could take without toppling. 

In 2020, Rood had conducted an analysis 
of precarious rocks along the Hosgri fault 
near California’s Diablo Canyon Power Plant 
to gauge the earthquake threat to the plant’s 
nuclear reactors. There, too, she found that 
the hazard was overstated. The large and 
repeated disagreements with the USGS haz- 
ard model suggest researchers may need to 
“go right back to the basics of earthquake 
science and check our fundamental under- 
standing,” she says. 

The use of fragile features to probe past 
earthquakes is sure to grow. New Zealand 
has already used similar balanced rocks to 
lower its estimate of the hazard at Clyde Dam 
in Otago, the nation’s third largest hydro- 
electric dam. The International Atomic 
Energy Agency will suggest using frag- 
ile features in its new seismic guidelines. 
And Devin McPhillips, a USGS earthquake 
geologist, is exploring how sea stacks, ghostly 
remnants of lost sea cliffs that are prone to 
collapse, can track the violence of ancient 
earthquakes along the Cascadia subduction 
zone in the Pacific Northwest. 

Once these analyses mature, they will be 
able to weed out underperforming ground 
motion models, McPhillips says. And for 
now, Californians can enjoy a bit of positiv- 
ity. “Big earthquakes do happen and will 
happen again in Southern California. I’m 
not looking forward to that day,’ he says. 
“But maybe I can hold out hope that it won’t 
be quite as damaging.” & 
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Before tunnel collapse, scientists sent warnings 


Researchers repeatedly raised concerns about building in Himalayas’ unstable geology 


By Athar Parvaiz 


ast month, government officials in In- 

dia celebrated after rescue teams freed 

41 construction workers who had been 

trapped underground for 17 days when 

a rock fall sealed a tunnel they were 

building beneath mountains in north- 
ern India. “Everyone involved in this mission 
has set an amazing example of humanity and 
teamwork,” Indian Prime Minister Narendra 
Modi said in a statement posted on the social 
media platform X (formerly Twitter) after the 
28 November rescue. 

For many Indian researchers, however, 
the near-disaster was the predictable out- 
come of officials ignoring repeated warnings 
from scientists about the dangers of trying to 
carve what is called the Char Dham National 
Highway through the unstable geology of the 
Himalayas. They are hoping the collapse— 
and problems related to other infrastructure 
projects—will persuade the government to 
reconsider building practices, and strengthen 
a legal challenge to the Char Dham project 


Pilgrim’s path 

Much of the Char Dham road project is de- 
signed to ease access to Hindu pilgrimage 
sites in Badrinath, Kedarnath, Gangotri, 
and Yamunotri. 
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currently before the nation’s Supreme Court. 
“The repeated disasters ... should prompt 
the government to re-evaluate construction 
plans and institute preventive measures,’ 
says environmental scientist Anjal Prakash 
of the Bharti Institute of Public Policy at the 
Indian School of Business. 

The Char Dham road has fueled contro- 
versy since 2016, when the Modi govern- 
ment made the $1.9 billion effort to upgrade 
900 kilometers of roads connecting four pop- 
ular Hindu pilgrimage sites the centerpiece 
of a major economic development project. 
The plan called for widening roads, boring 
tunnels, and building more than 100 bridges 
and 3500 culverts in rugged terrain. It re- 
quired extensive blasting, clearing forests, 
dumping tons of waste, and slicing massive 
roadcuts into steep hillsides—all activities 
that could trigger rockfalls, landslides and 
floods. But legal analysts say the government 
sidestepped requirements for a thorough en- 
vironmental assessment by cutting the proj- 
ect into segments less than 100 kilometers 
long, below the threshold for review. 


Ambulances await the rescue of 41 workers trapped in the Silkyara tunnel in late November. 
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In 2019, however, legal challenges from en- 
vironmental groups prompted the Supreme 
Court to appoint a 25-member High Powered 
Committee, led by Ravi Chopra, a prominent 
environmental scientist, to review the envi- 
ronmental impacts of the project, which was 
already underway. In a 2020 report, the panel 
reported that engineers had failed to analyze 
the geology and hydrology at numerous con- 
struction sites, leading to landslides and envi- 
ronmental damage. But the group delivered a 
split verdict on one contentious issue: the ap- 
propriate width of the pavement. A majority 
supported pavement 10 meters wide, whereas 
a minority led by Chopra recommended pav- 
ing be limited to 5.5 meters to reduce safety 
risks and environmental harm. 

After India’s military argued it needed 
the broader roads to defend India’s border, 
the Supreme Court in 2021 backed 10-meter 
pavement for much of the Char Dham proj- 
ect. In early 2022, Chopra resigned from the 
panel, writing that the experience had “shat- 
tered” his belief that the panel “could protect 
this fragile ecology,’ and that he suspected 
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construction officials would continue to ig- 
nore the panel’s technical recommendations. 

Other infrastructure-related _ disasters 
heightened such fears, says Navin Juyal, a 
former senior geologist at India’s Physical 
Research Laboratory. For example, he says 
officials have built hydropower dams and 
other facilities in recently glaciated valleys 
above 2000 meters, despite warnings from 
scientists that such areas can be geologically 
unstable and prone to flooding (a warning 
echoed by the High Powered Committee). 
In 2021, a glacial outburst killed more than 
140 workers at one such project, the Tapovan 
Vishnugad Hydropower Plant. 

Worries about Char Dam have grown in 
recent years, especially after rains triggered 
landslides that caused fatalities. A study of 
one 250-kilometer-long segment in Uttara- 
khand found that, after heavy rains in late 
2022, widened areas were twice as likely to 
experience slides than unwidened stretches. 
That is likely because roadcuts, blasting, and 
vegetation removal made slopes more likely 
to fail, the authors wrote in the preprint 
posted on the EGUsphere server in October. 

Given that history, last month’s collapse of 
the 4.5-kilometer-long highway Silkyara tun- 
nel in the state of Uttarakhand came as no 
surprise to seismologist C. P. Rajendran of 
the National Institute of Advanced Studies. 
The tunnel “is itself an example of shoddy 
construction that ignored the preliminary 
geological report that warned against the 
dominating presence of weak unstable rock 
wedges in the region,” he says. 

India’s Ministry of Road Transport and 
Highways did not respond to a request for 
comment. But government officials have told 
Indian media that engineers were aware the 
tunnel traversed a “shear zone” of brittle rock. 
Earlier this month, the ministry appointed 
a five-member panel to investigate the col- 
lapse and report by the end of the month on 
needed safety measures for the Char Dham 
project, which is nearly 70% complete. 

Rajendran is skeptical that the disaster will 
catalyze major changes in the government’s 
plans. But the tunnel collapse “has attracted 
huge public interest,” he says. “There is a real- 
ization that a disaster-prone road will neither 
serve any national security purpose nor make 
the pilgrimage safer.” And he hopes the event 
will persuade the Supreme Court, which is 
hearing a challenge to the Char Dham proj- 
ect, to impose stiffer construction rules. 

Environmental advocate Mallika Bhanot 
of Ganga Ahvaan, a citizen’s group, says the 
tunnel collapse should teach a broader les- 
son: India’s Himalayan region “can with- 
stand no more unscientific development.” 


Athar Parvaiz is a journalist based in New Delhi 
and Srinagar. 
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ANIMAL BEHAVIOR 


Chimps remember faces of old 
friends and family for decades 


Recognition ability rivals all other animals, including humans 


By Ann Gibbons 


ouise hadn’t seen her sister or nephew 
for 26 years. Yet the moment she spot- 
ted them on a computer screen, she 
recognized them, staring hard at their 
faces. The feat might have been impres- 
sive enough for a human, but Louise is 
a bonobo—one who had spent most of her life 
at a separate sanctuary from these relatives. 

The discovery, published this week in the 
Proceedings of the National Academy of Sci- 
ences, reveals that our closest primate cous- 
ins can remember the faces of friends and 
family for years, even decades. The study, ex- 
perts say, shows that the capability for long- 
term social memory is not unique to people, 
as was long believed. 

“Tt’s a remarkable find- 
ing,’ says Frans de Waal, 
a primatologist at Emory 
University who was not in- 
volved with the work. “I’m 
not even sure we humans 
remember most individu- 
als we haven’t seen for 
2 decades.” The research, 
he says, raises the possibil- 
ity that other animals can 
also do this, remember- 
ing far more than we give 
them credit for. 

Trying to figure out 
whether nonhuman primates remember 
a face isn’t simple. You can’t just ask them. 
So comparative psychologist Christopher 
Krupenye at Johns Hopkins University and 
colleagues used eye trackers, infrared cam- 
eras that noninvasively map a subject’s gaze 
as they look at images of people or objects. 

The scientists worked with 26 chim- 
panzees and bonobos living in four zoos or 
sanctuaries in Europe and Japan. The team 
showed the animals photos of the faces of 
two apes placed side by side on the screen 
at the same time for 3 seconds. Some images 
were of complete strangers; some were of 
close friends, foes, or family members who 
had once lived in their same social groups, 
but whom they hadn’t seen in years. 

The chimpanzees and bonobos gazed 
about a quarter-second, or 11% to 14%, longer 
at the faces of apes they used to know and 


Chimpanzee twins Golden and Glitter 
groom each other in Tanzania. 


associated with good times, compared with 
the faces of strangers. In some cases, the 
animal hadn’t seen the ape depicted on the 
screen for 5 or 10 years—or, in Louise’s case, 
for more than a quarter-century. 

The flicker of recognition was quick but 
consistent, and matches findings with human 
infants, who gaze longer at faces of people 
they know. “It’s like when you pass some- 
one on the street you hadn’t seen for years 
and you do a double take,” says the study’s 
lead author, Laura Lewis, an evolutionary 
anthropologist at the University of Califor- 
nia, Berkeley. The animals looked longest at 
the images of apes with whom they had had 
positive relationships, such as former group 
mates who had groomed them or played with 
them. They did not linger 
as long on former sexual 
partners, distant acquain- 
tances, or individuals who 
had hurt or scared them. 

Other animals, such as 
dolphins, elephants, and 
dogs, use sound or smell 
to recognize another fa- 
miliar individual’s call or 
scent. But the ability to 
use visual cues to remem- 
ber a friend’s face years af- 
ter parting has long been 
seen as unique to humans 
and critical to our ances- 
tors’ survival, de Waal says. Humans who 
recognized a friend or foe could renew alli- 
ances and trade food or other goods in hard 
times, he says. Chimpanzees form large, hi- 
erarchical groups, which often break up into 
smaller groups whose members come and go, 
sometimes with power struggles. So it makes 
sense that remembering friends would ben- 
efit chimps as well, de Waal says. 

Young apes go through intense phases of 
social learning, similar to human teens, says 
Rachna Reddy, a biological anthropologist 
at the University of Utah who was not in- 
volved with the work. That may help them 
make distinctions about whom to trust, 
she says, particularly for young females in 
preparation for when they leave their birth 
group. This study, she says, opens “a win- 
dow into the incredibly rich interior emo- 
tional life of chimpanzees.” 
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Ettore Majorana theorized a particle that 
was its own antiparticle—an idea that could 
bolster fragile quantum computers. 
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THE QUANTUM 
NTOM 


A ghostly quasiparticle rooted in a century-old Italian mystery could unlock 
quantum computing’s potential—if only it could be pinned down 


eclusive and disaffected, Ettore 
Majorana liked to work in the shad- 
ows. But after his friend Emilio 
Segré dragged him into Enrico 
Fermi’s elite Roman physics club 
in the late 1920s, Majorana’s 
stature in atomic physics quickly 
grew. His mostly unpublished pre- 
monitions were eerily prescient: 
Among others, he famously intuited the 
existence of the neutron from prior experi- 
ments. And in 1937, he conjured up a com- 
pletely new kind of particle. Physicists had 
learned that every fundamental particle 
seems to have an antimatter counterpart, 
an idea Segré would later earn a Nobel 
Prize for verifying. Majorana realized the 
equation that leads to this duality could 
also describe a single particle with identical 
matter and antimatter personas, making it 
prone to annihilate itself. 

Months later, the 31-year-old withdrew a 
large sum from his bank account, took a boat 
across the Tyrrhenian Sea, and vanished. To 
this day, nobody’s sure what happened to 
him, and the jury is also still out on whether 
his proposed particle exists. For example, 
some physicists still believe the neutrino—a 
wispy particle that pervades the universe— 
might be its own antiparticle, which could 
also help explain why the universe is filled 
with more matter than antimatter. But tests 
have so far proved inconclusive. 

However, scientists think they are close 
to approximating a Majorana particle in 
a much different guise. By confining elec- 
trons on flat surfaces, researchers can coax 
them into peculiar dances that collectively 
masquerade as a Majorana particle, much 
as the undulations of a flock of birds might 
appear like a swimming fish. 

If this Majorana quasiparticle could be 
corralled and controlled, the payoff would 
be twofold. It’s a chance to manifest exotic 
physics, a thrill on par with smashing out 
new particles in a collider. More tangibly, 
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By Zack Savitsky, 
in Santa Barbara, California 


Tiny superconducting loops power 
Google’s Sycamore quantum chips, but they 
are plagued with errors. 


the electron dances also promise to for- 
tify quantum computers, transforming the 
frail devices into practical machines that 
can realize their promise of vastly speed- 
ing up certain computations and simulat- 
ing the true nature of reality. “It’s such a 
surprising thing that electrons will do this,” 
says Charlie Marcus, a University of Wash- 
ington physicist and veteran of Majorana 
research. “And if it turns out that it’s really 
beneficial for quantum computing ... that 
feels like frosting on the cake.” 

The epicenter for the Majorana hunt 
is here, in this cushy Southern Califor- 
nia beach town that has seen an influx of 
hundreds of well-paid corporate scien- 
tists. Inside a stark black building with 
tinted windows, Google researchers at- 
tempt to engineer the quasiparticle on 
supercooled chips encased in chilling sys- 
tems that look like chandeliers. Across the 
parking lot, Microsoft researchers work 
to lure it to the ends of a wire 1000 times 
thinner than a strand of hair. Down the 
street, scientists at the University of Cali- 
fornia, Santa Barbara (UCSB) aim to trap it 
with pencil lead and Scotch tape. They’e all 


pretty sure they’ve got it—and that they're 
one step ahead of everybody else. 

But like the 85-year hunt for Majorana 
himself, the field has yet to deliver on tan- 
talizing leads. Contentious false alarms 
and a muddy mix of corporate and aca- 
demic interests have left researchers raw 
and bruised—even as they say they are on 
the cusp of a breakthrough. “It’s a field 
that has had some hiccups,” says Jason 
Alicea, a theoretical physicist at the Cali- 
fornia Institute of Technology (Caltech). 
“But there’s a lot of new tendrils that are 
opening up. There’s so much beautiful 
physics that’s waiting to be revealed.” 


AT AN ATOMIC LEVEL, 2D slices of solid ma- 
terials can be thought of as giant checker- 
boards in which the pieces are electrons and 
the vacant spaces are “holes, which behave 
like positively charged electrons. Under the 
right conditions, the motion of electrons and 
holes can synchronize to behave as a single 
quasiparticle, in which fragments of the elec- 
tron appear as if they are physically sepa- 
rated. These quasiparticles are sometimes 
dubbed “anyons” for their seeming ability to 
do anything. 

A subset of these anyons, for instance, can 
perform truly unforgettable dances. Split an 
electron into two virtual pieces and swap 
their places, and you'll subtly change their 
identities. When brought back together, 
the fragments might merge into a single 
electron—or they might disappear into the 
vacuum. Shuffling them shifts the probability 
that they'll converge to make something or 
nothing, and the outcome depends on all the 
prior dances they’ve ever shared with other 
partners. In this sense, twirling these par- 
ticles leaves a trail of their past movements, 
much like braiding hair, in which the order 
of operations is preserved even if the braid 
is twisted or tugged. The particles’ history is 
woven into the system’s topology, or math- 
ematical shape. 
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Google engineers huddle under a cooling system designed to chill quantum chips to temperatures colder than the depths of space. 


Thanks to a paper in 1997 by the physicist 
Alexei Kitaev, then at Russia’s Landau Insti- 
tute for Theoretical Physics, this strange elec- 
tron choreography became the blueprint for 
anew kind of computer. 

Traditional computers rely on transistors, 
switches that turn electricity off and on, en- 
coding either a O or a 1—a “bit” of informa- 
tion. Connect these switches in the right way, 
and the machine can run any conceivable 
calculation, although exceptionally complex 
tasks could take billions of years. 

Quantum computers promise to speed up 
the process by exploiting the strange rules 
of quantum physics. If your quantum-bit, or 
“qubit,” is made from an object like an atom, 
it can embody many different superpositions 
of 0 and 1 states. A relatively small array of 
qubits, linked through a process called en- 
tanglement, can encode a vast number of 
possible computations. For certain questions, 
scientists can manipulate those qubits using 
algorithms that boost the probability of them 
settling on the correct answer. 

In theory, tasks such as simulating chemi- 
cal interactions and factoring large num- 
bers (the basis for modern cryptography) 
suddenly become a lot more manageable— 
making quantum computers both a lucrative 
gadget and a national security threat. 

The catch is that qubits are fragile. Envi- 
ronmental noise can easily disturb them, 
leading to erroneous changes in their prop- 
erties or a premature collapse to a definite 
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0 or 1 state. Because the qubits are entangled, 
these errors spread like wildfire, and the en- 
tire computation goes kaput. 

Standard quantum firefighting calls for re- 
dundant qubits to correct the errors, but that’s 
proving to be an epic challenge. The hearti- 
est proto-quantum computers today boast 
hundreds of qubits, far short of the millions 
or billions they will probably need to operate 
error-free. “It is turning out to be an engi- 
neering nightmare,” says Sankar Das Sarma, 
a theoretical physicist at the University 
of Maryland. 

Kitaev proposed a clever workaround 
to the problem. In his scheme, the qu- 
bits would consist not of atoms, but those 
special anyon pairs, which can capture 
quantum information in their dissociated 
electrons. By entangling multiple anyons 
and nudging them through the braided se- 
ries of dances, you can perform calculations 
that settle on the right answer—which is fi- 
nally read out by revealing whether the any- 
ons merged into an electron (a 1) or nothing 
(a 0). Because the electron is effectively split 
and doesn’t exist in any one place, it is im- 
pervious to local disturbances. 

The challenge then becomes how to get an 
electron to scatter its tracks so deceivingly. 
The simplest way is to link an electron and 
a hole so perfectly that they form two sepa- 
rate entities that are each half-electron and 
half-hole. These quantum yin-yang particles, 
both identical and opposite, much like the 


particles Majorana posited, would have no 
energy and no charge. With zero energy, the 
paired quasiparticles would rest securely in a 
gap between the positive and negative energy 
levels where electrons and holes can exist, of- 
fering extra protection from the noisy world. 

Kitaev’s idea of harnessing the quasi- 
particles’ built-in error protection is now 
known as topological quantum computing, 
and some believe it’s the technology’s best 
hope. Without the Majorana at its core, 
“T personally do not see any hope ever of 
building a true fault-tolerant quantum com- 
puter,’ Das Sarma says. “Some revolution- 
ary advance is needed.” 

Around 2004, UCSB mathematician Mike 
Freedman wrote a letter directly to Micro- 
soft founder Bill Gates promoting the idea of 
topological quantum computing. A year or so 
later, the company boldly bought in and es- 
tablished Station Q, initially just a five-person 
team that met in a seminar room inside the 
hexagonal oceanfront tower of the presti- 
gious Kavli Institute for Theoretical Physics, 
which juts out from the UCSB campus like 
a lighthouse. With a seemingly limitless bud- 
get and free range to explore wild ideas, the 
group was the envy of the fledgling world of 
topological computing, says Zhenghan Wang, 
a UCSB mathematician and founding mem- 
ber of Station Q. “It was the perfect place.” 


BEFORE ETTORE MAJORANA vanished in 
1938, he sent a series of cryptic messages to 
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his family and colleagues about his impend- 
ing “sudden disappearance,” leading many 
to suspect he’d taken his own life. But over 
the following years, purported sightings of 
Majorana multiplied: as a beggar in Naples, 
a monk in Calabria, and a vagabond in South 
America. The search for the ghost of his par- 
ticle has been similarly confounded by im- 
postors. “There are many, many papers ... all 
of them claim it’s Majorana,’” Das Sarma says. 
“None of them is Majorana.” 

Microsoft’s opening strategy was to take 
advantage of the fractional quantum Hall 
effect, a peculiar behavior of electrons in 
2D systems. Under a high magnetic 
field, electrons in the material take 
on specific energy levels, and each 
level can hold a maximum num- 
ber of electrons. The electrons re- 
arrange themselves to avoid one 
another, creating a sort of quantum 
fluid with properties that depend on 
how large a fraction of a given en- 
ergy level is filled. If you half-fill the 
second energy level, for instance, a 
slight change in the magnetic field 
should induce ripples in the fluid 
that seem to be quarter-electron 
charges, each of which is associated 
with a Majorana quasiparticle. 

Microsoft researchers began to look 
for these fractional charges in samples 
of gallium arsenide, a compound of- 
ten used in solar cells. But after years 
of trying, scientists couldn’t get the 
material to cooperate: The Majorana’s 
protective energy gap was too small, 
and applying electrical gates to wran- 
gle the quasiparticles into any useful 
dance was too challenging. 

Just as hope started to wane 
for the quantum Hall approach, a 
new proposal emerged: wrapping a 
superthin nanowire in a fancy super- 
conducting jacket, which allows the 
electrons inside to link up in a long, 
single-file chain of pairs. If there’s 
an odd number of electrons in the 
wire, the unpaired electron should 
essentially split in half to form two 
separated Majorana quasiparticles— 
one at each end of the wire. If you then lay 
multiple wires across each other in lattice- 
like frameworks, you can get the Majoranas 
to dance and braid by having them hop be- 
tween intersections. 

The Microsoft team readily changed 
horses and began to fund nanowire research. 
In 2012, one team, led by Leo Kouwenhoven, 
a physicist at the Delft University of Techno- 
logy, varied the voltage across the tip of their 
wire from negative to positive values and re- 
ported seeing a small peak in conductance 
at zero—evidence, they thought, of Majorana 


SCIENCE science.org 


pairs residing at zero energy. But theorists 
soon realized other less interesting electron 
quasiparticles could also hover around zero 
energy and mimic that same signature. It was 
the first of many Majorana mirages. 

Some theorists argued that the real 
Majorana would have a clear signature: a 
zero-energy conductance peak at a specific 
value. In 2018, Kouwenhoven’s group mea- 
sured that mark, which at the time was con- 
sidered a “smoking gun,’ he says. Microsoft 
crowned Kouwenhoven the founding direc- 
tor of its new campus in Delft. The compa- 
ny’s vice president predicted they'd have a 


Technicians extract thin graphite layers with Scotch tape (bottom), 
then carve electrical gates (top) to coax Majoranas into qubits. 


working topological qubit later that year and 
urged the Department of Energy to invest in 
the blossoming “quantum economy.” 

But then the bubble popped. The signal 
was again shown to be inconclusive. Soon 
after, Kouwenhoven’s paper fell under scru- 
tiny for alleged data manipulation and in 
2021 was retracted by Nature, which cited 
a combination of confirmation bias and in- 
strumental miscalibration. In the following 
years, two similar claims from other groups 
were retracted by Nature and Science, and a 
fourth paper led by Marcus received an edi- 


torial expression of concern in Science. 

The reason researchers kept getting fooled, 
Kouwenhoven maintains, is that they focused 
on finding certain signatures without fully 
understanding the underlying mechanisms. 
“We jumped over a huge amount of physics,” 
he says. Other scientists suggest the prob- 
lem runs deeper, pointing to the pressure to 
deliver results and publish in high-impact 
journals. Chris Palmstrom, a UCSB physi- 
cist who manufactures nanowires for many 
experiments, traces it to Microsoft’s over- 
zealous push to demonstrate a topological 
qubit. “They were trying to run before they 
could crawl,’ he says. “It was perhaps 
too early to promise all of this.” 

Around the time many of these 
claims fell apart, Microsoft had a 
culture change. Chetan Nayak, a Mi- 
crosoft physicist since the dawn of 
Station Q, took over from Freedman 
as engineering lead with a business- 
oriented vision that would focus on 
commercialization while guarding 
against intellectual property leaks. 
“We're not in a mode of explora- 
tion,” Nayak says. “You need to have 
everyone focused on this journey, 
and that’s not going to happen with 
a bunch of academic labs.” Microsoft 
eventually cut most of its funding 
for its university collaborations and 
moved the bulk of its quantum com- 
puting research in-house. 

The shift drove many big names out 
of the game. “Early in the academic- 
corporate quantum space ... we just 
had the best of both worlds,’ laments 
Marcus, who left Microsoft in 2021. 
“Then we grew up and the corporate 
experience changed.” Kouwenhoven 
departed the following year. Earlier 
this year, Freedman also left Mi- 
crosoft, and he recently joined the 
competing Google team across the 
parking lot. 

Fiona Burnell, a theoretical physi- 
cist at the University of Minnesota 
Twin Cities, credits Microsoft’s invest- 
ments with helping lure her into the 
field, but says the company’s secrecy 
complicates replication efforts. “It’s a double- 
edged sword,’ she says. Microsoft’s latest 
Majorana publication in June, for example, 
omits recipes for fabricating nanowires, 
which the company regards as trade secrets. 
“Of course, I wish that they would maintain 
more openness with the academic commu- 
nity,’ Alicea says. “But maybe it’s impossible.” 


IN 2011, the public prosecutor’s office in 
Rome reopened its investigation into 
Ettore Majorana’s disappearance. Detectives 
had come across an alleged photograph of 
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A search for tenacious qubits 


Majorana quasiparticles, made from pairs of fractional electrons, could in theory embody the quantum bits (qubits) in an error-free quantum computer. 
The physical separation of the fractional electrons should protect the quantum information they hold from heat and noise, and computations occur by swapping or 
braiding their positions. Researchers are hunting for the quasiparticles in three systems. 


Nanowires 


Fractional 
electron 


Nanowire —4 


Superconducting jacket , 


Paired electrons 


Semiconductor wire 


Pairing up electrons within an ultrathin wire can leave 
an unpaired electron at either end that virtually splits 
to form Majorana quasiparticles at each end. Shunting 
them along a nanowire lattice creates braids in the 
original Majorana pairs. 


Majorana from 1955 in Venezuela, where he 
had supposedly been living under the alias 
“Mr. Bini.” They attempted to match the 
man’s facial structure to an old photo of Ma- 
jorana and concluded that he had run away 
to South America, perhaps because he fore- 
saw how his colleagues’ work on atomic en- 
ergy could lead to nuclear weapons. Content 
with the explanation, the prosecutor’s office 
officially closed the case in 2015. But many 
were left dissatisfied. Francesco Guerra, a 
physicist at the Sapienza University of Rome 
who continues to investigate the case, says 
the conclusion was “completely ridiculous.” 
Majorana researchers, similarly vexed by 
the low standard of evidence, are now push- 
ing to raise the bar. “A photo is not enough,’ 
Kouwenhoven says. “We need DNA evidence.” 
The growing agreement is that the only con- 
clusive signature of Majoranas would be to 
get them to work as qubits. “It’s braiding or 
bust,’ says Andrea Young, a UCSB physicist. 
The present search for Majoranas is pro- 
ceeding along three primary avenues. The 
first is the continued effort in nanowires, 
a torch largely carried by Microsoft. In 
their June paper, Microsoft researchers re- 
ported progress in making their nanowire 
Majoranas more robust, a feat Nayak lik- 
ened in a LinkedIn post to “inventing steel.” 
The Majorana’s protection depends on the 
size of the gap between the quasiparticles’ 
zero-energy state and the electrons’ closest 
available energy level. If the gap is too small, 


1346 22 DECEMBER 2023 + VOL 382 ISSUE 6677 


Graphene 


Fractional 
electron 


Braiding 


Graphene 
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Carbon lattice 


When magnetic fields are applied to graphene, a single- 
atom layer of carbon, electrons and positively charged 
“holes” can virtually split into Majorana pairs. The pairs 
may be more immune to heat than those in nanowires 
but harder to control. 


tiny temperature variations will inject energy 
that can either destroy the Majoranas or cre- 
ate unwanted quasiparticles that might mess 
with the Majorana signal, particularly if the 
wire’s ends aren’t sufficiently separated. 

The Microsoft researchers measured 
the energy gap of their nanowires at 
30 microelectronvolts, above the 10 ev 
needed for them to be confident the signal is 
real. The next steps are to show that a pair of 
Majoranas can merge to a 0 or 1 state and to 
braid two Majorana pairs into a functional 
qubit. Nayak believes his team can do both. 
“Internally, we’re very convinced,” he says. 
“We're kind of off to the races with this.” 

The community, however, is largely with- 
holding applause. Some continue to shame 
Microsoft for safeguarding fabrication 
methods and parts of its simulations. Oth- 
ers complain the wire is still not clean or 
long enough to prevent internal disorder 
in the wire from triggering spurious sig- 
nals. But even assuming the signal is from 
Majoranas, the tiny energy gap Microsoft re- 
ported would offer limited protection in real 
world operations, says Mike Zaletel, a theo- 
rist at UC Berkeley who previously worked 
on the nanowires at Station Q. “It’s hard to 
see that they would have a very robust qubit 
with a gap that small.” 

At the nearby UCSB, Young is taking a 
wholly different route to Majorana quasi- 
particles, one that revives the fractional 
quantum Hall effect, but in graphene instead 
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Some quantum computing chips consist of error- 
prone superconducting qubits. Snipping connections 
between them can simulate the presence of Majorana 
quasiparticles and distribute information across the 
qubit array, thereby protecting it. 


of gallium arsenide. Graphene, an atom- 
thick layer of carbon arranged in a hexago- 
nal array, was first discovered in 2004 when 
researchers peeled off a layer from graphite 
using nothing more than Scotch tape. Young 
and colleagues have since figured out how to 
stack slivers of graphene and other materials 
into perfectly clean sandwiches, providing 
a quantum Hall playground that’s thinner 
than gallium arsenide, where electrons can 
more easily be coaxed to interact and split 
into quarter-charges. They also learned 
how to carve graphite crystals into intricate 
electrical gates, which they can place on 
top of their devices to trap and control the 
quasiparticles. 

Recently, Young and colleagues measured 
the energy gap of the state where Majoranas 
are expected to form in their graphene, 
and found it to be about 10 times larger 
than Microsoft’s gap. That should protect a 
graphene-based qubit at temperatures as 
high as 5 kelvin—relatively warm for quan- 
tum work like this, Young says. “It really fi- 
nally feels like we have an avenue to attack 
this legitimately,’ he says. “I think we will be 
able to make a qubit without any miracles.” 

Ali Yazdani, a physicist at Princeton Uni- 
versity working on similar graphene sand- 
wiches, uses electron microscopes to create a 
map of how the energy gap varies across the 
graphene surface, offering a useful guide for 
where robust Majoranas might congregate. 
In a paper now under review, he and his col- 
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leagues identify multiple spots where the en- 
ergy gap is “whoppingly huge,” he says—even 
larger than Young’s. The two groups have 
planned experiments to implant the quarter- 
electron charges and braid their accompany- 
ing Majoranas. “It’s early days for taking this 
to the next level, but there is no clear show- 
stopper,’ Yazdani says. 

Already, the graphene efforts have offered 
“a breath of fresh air” to the community, 
Alicea says. “It’s one of the most promising 
avenues that I’ve seen in a while.” Since leav- 
ing Microsoft, Zaletel has shifted his 
focus to graphene. “It’s clear that this 
is just where you should do it now,’ 
he says. 

But not everyone believes they 
will have enough control over the 
free-moving quasiparticles in the 
graphene system to scale up to an ar- 
ray of qubits—or that they can create 
big enough gaps to keep out intrud- 
ers. Manipulating the quarter-charge 
quasiparticles in graphene is much 
more complicated than moving the 
Majoranas at the ends of nanowires, 
Kouwenhoven says. “It’s super inter- 
esting for physics, but for a quantum 
computer I don’t see it.” 

Just across the parking lot from 
Station Q’s new office, a third kind 
of Majorana hunt is underway. 
In an unassuming black building 
branded Google AI Quantum, past 
the company rock-climbing wall 
and surfboard rack, a dozen or so 
proto-quantum computers dangle 
from workstations, hidden inside 
their chandelier-like cooling systems. 
Their chips contain arrays of doz- 
ens of qubits based on a more con- 
ventional technology: tiny loops of 
superconducting wires through 
which current oscillates between two 
electrical states. These qubits, like 
other standard approaches, are beset 
with errors, but Google researchers are hop- 
ing they can marry the Majorana’s innate er- 
ror protection to their quantum chip. 

Error correction on classical comput- 
ers works by copying one bit’s information 
to additional bits, but that’s a challenge 
in quantum computers, because the laws 
of quantum mechanics forbid quantum 
states from being “cloned.” Google’s work- 
around (also conceived by Kitaev) is to get 
the array of qubits to act like a topological 
material and to store the information non- 
locally, smearing it across many qubits so that 
it’s not accessible from any single measure- 
ment. Recently, Google engineers brought the 
Majorana connection a step closer to reality 
with the help of Cornell University theorist 
Eun-Ah Kim. 
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In graduate school, Kim worked on de- 
signs for braiding anyons in quantum Hall 
systems, but she grew frustrated with the 
endless engineering hurdles. Last year, af- 
ter Google researchers presented their lat- 
est qubit array at Cornell, “I realized that 
this is the setting where my dream can 
finally come true,” Kim says. She and col- 
leagues learned to group together patches 
of qubits within Google’s array and then 
remove connections between individual 
qubits to create imperfections in the grid. 


Chris Palmstrgm uses a vacuum-sealed apparatus to construct and 
clean nanowire devices that may hold Majorana quasiparticles. 


In a Nature publication in May, the team 
demonstrated that twirling those imper- 
fections shifts their properties much like 
braiding Majoranas. The company Quan- 
tinuum reported a similar experiment in a 
preprint around the same time. 

Unlike naturally emergent Majoranas, 
these simulated ones are only stable while 
errors are manually corrected. But the dis- 
tinction doesn’t matter as long as they help 
protect the qubits, says Hartmut Neven, 
founding director of Google Quantum AI. 
“The question is: What is easier to do?” he 
asks. “Trying to do it on the low-level hard- 
ware is maybe the harder route.” 

The Google team has yet to use the scheme 
to actually detect and correct errors. Still, the 
experiment demonstrates scientists’ growing 


command over quantum systems, Marcus 
says. “We're beginning to master fire.” 


OVER THE YEARS, accounts of Ettore 
Majorana’s disappearance have degraded into 
baseless conspiracies and fictional spin-offs 
featuring alien abductions and time-travel 
machines. But Guerra and Nadia Robotti, a 
historian of physics at the University of Ge- 
noa, think the truth is out there. They have 
uncovered new documents from govern- 
ment, religious, and familial archives that 
seem to reference Majorana’s death 
about 1 year after his disappearance. 
For example, one letter written by a 
Jesuit priest to Majorana’s brother 
in 1939 honors “the mourned Ettore 
Majorana” and “the beloved extinct.” 

Guerra and Robotti still do not 
know where or how he died, but they 
think his descendants have docu- 
ments that would shed more light. 
And 3 years ago, the researchers flew 
to UC Berkeley to visit the archive of 
Emilio Segré, whose friendship with 
Majorana soured in the years before 
his disappearance. The researchers 
came across a folder that, per Segré’s 
instructions, cannot be opened until 
the year 2057. Guerra hopes it will 
contain details that will resolve the 
case once and for all. 

The search for the Majorana quasi- 
particle is following a similar dy- 
namic. Many researchers are jaded, 
including Palmstrom, who says the 
field’s spotty history has eroded his 
optimism. Even Kitaev, the pioneer, 
who now works at Caltech and con- 
sults for Google, has grown cynical. 
“Theoretically, it’s just beautiful, but 
in practice it seems that we don’t 
have a suitable technology to make it 
work well,” he says. 

And yet, many in the game 
can’t seem to give up the hunt. 
Palmstrom continues to meticulously prepare 
semiconductor nanowires for study, baking 
them in a foil-wrapped vacuum to expel un- 
wanted gas molecules, scrubbing them with a 
pressure hose of atomic hydrogen to remove 
every last contaminating atom, and coating 
them with a thin superconducting jacket. 

Like most  Majorana _ researchers, 
Palmstrom does not expect a single trium- 
phant moment of discovery. But he’s hooked 
on the hope that iterations of increasingly 
precise experiments will gradually bring 
the long-sought particle into focus. 

“Were driven to find the truth,’ 
Palmstrom says. “I think we will.” 


Zack Savitsky is a science journalist in Florida who 
covers physics and astronomy. 
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50 YEARS OF THE ENDANGERED SPECIES ACT 
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Genomics-based detection of inbreeding 
among southern resident killer whales can help 
explain the population’s lack of recovery. 


A landmark environmental law looks ahead 


n late December 1973, the United States enacted what some would come to call “the pitbull of environmental 
laws.” In the 50 years since, the formidable regulatory teeth of the Endangered Species Act (ESA) have been 
credited with considerable successes, obliging agencies to draw upon the best available science to protect 
species and habitats. Yet human pressures continue to push the planet toward extinctions on a massive scale. 
With that prospect looming, and with scientific understanding ever changing, Science invited experts to 
discuss how the ESA has evolved and what its future might hold. —Brad Wible 


Drawing from beyond 


the life sciences 


By Robert L. Fischman! and J. B. Ruhl@ 


The ESA-implementing agencies, required by the ESA to follow the 
“best scientific and commercial data,’ enjoy respect for and build on 
long traditions of natural science expertise. Less often recognized is 
that many of the ESA’s success stories drew mainly from the social 
sciences. These offer lessons for how to improve effectiveness of ef- 
forts under ESA. 

Congress imposed prohibitions in the ESA that can create perverse 
incentives for landowners who face costs associated with the pres- 
ence of protected species. For instance, because a certain minimum 
tree size is required for nesting by endangered red-cockaded wood- 
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peckers, timberland owners were more likely to harvest trees before 
they reached that size and ESA protections were triggered (1). To 
reduce preemptive habitat suppression, in the 1990s the US Fish and 
Wildlife Service (USFWS) created a safe harbor program that encour- 
ages land managers to voluntarily provide habitat for species—even 
if temporary (e.g., trees big enough to host owl nests just until they 
grow to ideal harvest size)—in exchange for immunity from regu- 
latory enforcement. This realigned incentives to generate greater 
conservation from private property (2). 

Elinor Ostrom and others have documented the effectiveness of 
collaborative governance, typified by local, bottom-up, self-enforcing 
management approaches. The threat of ESA “harm” liability from 
incidentally injuring a protected animal can prompt large-scale 
multi-landowner projects that come closer in footprint to the range 
of a species than could any single property boundary. Rather than 
navigate ESA compliance property by property, landowners may 
devise a regional plan that will fulfill the statutory conservation man- 
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date if they receive immunity for activities consistent with the plan. 
In this way ESA-implementing agencies created the conditions to 
spur collaborative conservation efforts pooling economic benefits by 
leveraging local knowledge, habitat ownership, and less burdensome 
compliance measures (3). 

Some ESA agency rules incorporate public-private plans or best- 
management practices that transform the ESA’s harm prohibition— 
focused on difficult-to-detect consequences such as injury—into 
programs promoting best practices adapted to particular circum- 
stances in specific places (4). For instance, it would be impractical to 
detect harm from agricultural activities, such as plowing, to Califor- 
nia’s Mazama pocket gophers nestled in their burrows. But a tailored 
rule shields from liability any “accepted agricultural or horticultural 
(farming) practices” if soil disturbance does not penetrate deeper 
than a foot. That provides a clear standard for both farmers and 
regulators to track and allows agricultural activities to coexist with 
species recovery. 

Conservation treatments in wildlife management commonly 
incorporate adaptive management to fine-tune (or abandon) plans 
to recover or sustain wildlife, iterating to reduce uncertainty through 
learning by doing (5). Yet adaptive management is hardly used to im- 
prove the programs and rules themselves that spur and govern adap- 
tive conservation plans. That risks innovation drift if effectiveness is 
never precisely defined, measured, and used to trigger reevaluation. 
Agencies have adequately applied science to understand what makes 
innovations work and to fine-tune agency behaviors. Sometimes 
conservation science agencies need to turn their observational tools 
on themselves. 

Going forward, an important research priority for the ESA is 
to understand better how to prompt human behavioral changes. 
Mere statutory prohibitions are not enough. The agencies should 
employ lessons from empirically tested models outside of conser- 
vation biology, genetics, and other life sciences. Like all envi- 
ronmental law, the ESA is—first and foremost—law governing 
humans, not the forces of nature. 


Updating practices 
for the genomic era 


By Brenna R. Forester?, Tanya M. Lama’, Marty Kardos® 


Genetic data have been used for decades in ESA decision-making, 
most commonly for taxonomy and the delineation of subspecific 
units. Technological advances have made much larger genomic data- 
sets available for at-risk species, improving the precision and resolu- 
tion of metrics such as genetic diversity, while bringing previously 
inaccessible parameters like adaptive differentiation and individual 
inbreeding within reach. Although inferences from genomics can 
present challenges to established ESA practices, they also provide 
opportunities for innovation (6). 

Genomic data advance ESA implementation in three areas: identi- 
fication of listable units (i.e., species, subspecies, and distinct popula- 
tion segments), assessments of viability, and development of recovery 
strategies. Adaptive genomics, which improves our understanding of 
adaptive differentiation and evolutionary potential, is already being 
used to inform these three objectives (7). For example, these data can 
help describe “significance” in the designation of distinct popu- 
lation segments, by characterizing a species’ adaptive diversity 
and evolutionary legacy. In a recent case, a small genomic region 
was found to be associated with the seasonal timing of spawning 
migration runs, an important life-history trait in Pacific salmon. 
This discovery challenged established approaches for defining 
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significance in salmonid conservation units, suggesting the need 
for finer-scale delineations to conserve fish with the early-run 

life history. However, lack of reproductive isolation and genome- 
wide similarity among fish with different phenotypes resulted in 
a recommendation to retain existing larger-scale units. Additional 
research identifying the dominance patterns and evolutionary 
history of this genomic region underscored the importance of es- 
tablished guidelines to conserve phenotypic diversity within units, 
ensuring that recovery actions prioritize retention of early-run 
alleles to prevent irreversible loss of the phenotype (8). 

Viability assessments and recovery efforts are also being im- 
proved by advances in the detection of inbreeding (mating between 
close relatives). Small, reproductively isolated populations are 
particularly susceptible to reduced fitness due to inbreeding (i.e., 
inbreeding depression). Runs of homozygosity (ROH), large con- 
tinuously homozygous regions in the genomes of inbred individu- 
als, can now be identified through genomic analysis, and provide 
greater power to detect inbreeding depression than traditional, 
pregenomic approaches. For example, endangered southern resi- 
dent killer whales having more ROH were found to have reduced 
survival, with population models suggesting that this inbreeding 
depression has limited population growth. These findings help 
explain the population’s lack of recovery despite efforts to reduce 
extrinsic environmental threats (9). Genomic analysis of inbreed- 
ing can also be useful in cases where detailed demographic data 
are unavailable, for example, to identify declining populations with 
particularly high inbreeding that might benefit from recovery ac- 
tions such as genetic rescue. 

Genome-scale data will continue to be scarce for at-risk species 
given the magnitude of the biodiversity crisis. It is therefore criti- 
cal that genomic inferences are evaluated in the context of popula- 
tion genetic theory, to allow more confident extrapolation to cases 
where data are lacking (10). For example, studies that link genomic 
variation to fitness can inform the development and validation of 
more widely and rapidly applicable proxies of genetic health and 
viability in at-risk species (71). Finally, we caution against reliance 
on advanced interventions such as cloning and gene editing, which 
are unlikely to be broadly applied to listed species. Across the data- 
availability spectrum, ensuring the viability of species before they 
become critically imperiled is best supported by time-tested con- 
servation biology principles: maintaining intact habitats sufficient 
for large, connected populations across species’ ranges to ensure 
the integrity of ecological and evolutionary processes. 


Fostering international 
conservation 


By Grethel Aguilar Rojas® and Nicholas A. Robinson’ 


The ESA nationally, and the Convention on International Trade in 
Endangered Species (CITES) worldwide, protect species at risk of 
extinction. Created together, these laws are entwined and symbiotic. 
Sixty years ago, Congress sought to update the Lacey Act of 1900, 
which criminalizes trade in wildlife taken in violation of state or 
foreign laws. Meanwhile the International Union for Conservation of 
Nature (IUCN) was proposing a treaty to curb trade of species listed 
on IUCN’s Red List of Threatened Species. In 1972, IUCN’s experts 
testified in Congress to support enactment of an Endangered Species 
Act. The US Department of Interior supported IUCN’s proposals for 
a new treaty. The US Department of State led negotiations of the 
“Washington Convention on International Trade in Endangered Spe- 
cies of Wild Fauna and Flora,’ which nations signed on 3 March 1973. 
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On 19 December 1973, the US House of Representatives and Senate 
both approved the ESA (72). 

Under the ESA, the USFWS lists species as endangered or threat- 
ened regardless of the country in which the species lives. ESA fosters 
scientific collaboration worldwide to gather data for both ESA’s list- 
ings and listings under CITES. ESA thus fosters international conser- 
vation of habitats and species. Governments know more is needed. 
When the United Nations Decade on Ecosystem Restoration sunsets 
in 2030, governments aim to have protected 30% of Earth’s ecosys- 
tems. In September 2023, 83 nations signed a treaty to conserve 
marine biodiversity on the high seas. The 2022 Kunming-Montreal 
Framework of the Convention on Biological Diversity (CBD) sets 
targets for national action through 2030. In 2015, the US joined 193 
nations at the UN endorsing Sustainable Development Goal 15 to 
halt biodiversity loss by 2030. 

This path toward 2030 began in 1973. The ESA established the fed- 
eral Management and Scientific Authorities that implement CITES. 
When all 184 nations in CITES confer annually, they expose anew the 
sixth mass extinction of species. Acknowledging that more than the 
ESA-CITES approach is needed, IUCN stimulated the negotiation of 
further treaties, such as the 1979 Convention on Migratory Species 
(12). Expanding on the ESA’s ethical premises, IUCN proposed a 
normative basis for further action, which the UN General Assembly 
adopted as the 1982 “World Charter for Nature” (12). This Charter 
prescribes principles for protecting genetic viability, ecosystems, and 
population levels sufficient for survival of species. On the basis of 
these principles, in 1990 IUCN again proposed a new treaty, which 
the UN launched as the CBD at the 1992 “Earth Summit” in Rio de 
Janeiro (72). Ironically, controversies about the spotted owl, listed as 
threatened under the ESA, blocked US plans to sign the CBD in Rio. 
In 1993, the US did sign the CBD, but Congress has yet to ratify it. 

The plight of biodiversity is grave. Success in 2030 and beyond will 
depend upon rekindling the spirit of 1973. 


Assisted migration—moving 
species by translocation 


By Patrick D. Shirey® and Gary A. Lamberti? 


Assisted migration is the intentional translocation of species 
outside of their recent range to mitigate environmental change, 
which could include climate change, impaired watersheds, or al- 
tered land use (13). Perhaps the most famous translocation within 
the United States is the snail darter (delisted in 2022)—which was 
moved outside of its historic range in the 1970s to mitigate Tellico 
Dam construction in the darter’s habitat. Other examples include 
the Virginia roundleaf birch (agencies propagated and distributed 
seedlings to botanical gardens after the population dwindled to 
40 trees), and the Tennessee coneflower (nurseries and botani- 
cal gardens helped propagate the species to reduce risk to wild 
populations and supplement the wild populations). Though the 
basic science and logic behind translocation are straightforward, 
the scientific evidence to suggest whether it overall does more 
good than harm, and if good, then how best to do it, has been 
unsettled. This challenge has led to calls for controlled experi- 
ments with adequate planning and monitoring to be included in 
management plans for imperiled species (J4). 

In the Endangered Species Act of 1973, Congress did not restrict 
translocation as a management tool. However, in 1984, at a time 
when the Reagan administration was concerned about imperiled 
species potentially restricting private property development, the 
USFWS placed a restriction on translocations such that experi- 
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mental populations could not be introduced outside of a probable 
historic range unless the primary habitat of the species has been 
unsuitably and irreversibly altered or destroyed [49 FR 33893, 50 
CFR §17.81(a) 1984] (25). One management challenge under this 
restriction is that the regulations did not define unsuitable and 
irreversible habitat destruction. However, this regulatory restric- 
tion only applied to plant species occurring on federal or state 
land; privately owned plants did not have such restrictions against 
movement unless being sold in interstate commerce, opening the 
door for citizen-initiated assisted migration of listed, imperiled 
plant species (16). 

In 2010, we suggested that the USFWS revisit the experimental 
population restrictions because assisted migration could be a vi- 
able management option albeit with risks (75). In 2023, the USFWS 
changed the regulation to eliminate the historic range restriction 
(88 FR 42642), noting that establishing populations outside of a 
historical range is necessary to avoid extinction for species such 
as the Florida Key deer threatened by climate change, and the 
Guam rail and Guam kingfisher (sihek) threatened by the invasive 
brown tree snake. Under the 2023 rule, the agencies must demon- 
strate that the experimental population will further conservation 
of the species, and must monitor possible adverse effects to the 
ecosystem that may result from the experimental population being 
established outside of its historical range. 

Though the experimental population regulatory changes were 
sensible measures to improve tools available to agency biologists, 
the risks of assisted migration warrant a precautionary approach 
that requires detailed planning prior to coordinated assisted 
migration of a species. This detailed planning includes updating 
older recovery plans (17) and addressing the problem of chronically 
underfunded species recovery efforts (78). 


Harnessing economics for 
effective implementation 


By Amy W. Ando!° 


Though the ESA precludes the use of economic analysis in making 
listing decisions, insights and tools from economics have helped to 
make management and policy related to the ESA more successful 
and trigger sweeping changes in many human behaviors includ- 
ing logging, development, and water use. For example, economics 
research has informed efforts to reduce perverse habitat destruc- 
tion incentives created by the original ESA and helped to quantify 
the impacts, costs, and benefits of ESA protections (79). Because 
natural resource economics has powerful tools (both analytical 
theory and numerical optimization) for optimizing policy and 
management in the face of trade-offs, uncertainty, and human 
behavior, this discipline can contribute yet more to biodiversity 
conservation under the ESA in a world where habitats are com- 
plex and changing. 

Economists are working with ecologists and scholars of water 
management on strategies to help species in complex aquatic habi- 
tats (20). Joint management of water resources and aquatic species 
can help to minimize the cost of protecting species; for example, 
when planning dam removal to improve salmon habitat and mi- 
gration, considering hydropower benefits can help decision-makers 
choose sites for removal that minimize the social costs. And efforts 
to regulate water use to protect in-stream flows for species need 
to be careful not to regulate only one type of water use when us- 
ers can shift to deplete other sources instead. Also, bioeconomic 
research can inform strategies to save migratory species (21). New 
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Whooping cranes in South Dakota take off during spring migration. Economics research can inform strategies to ensure availability of habitat during migration. 


policies such as “pop-up” habitat modification (like flooding of 
rice fields) or permeability improvement (like taking down fences) 
during times that migratory species are passing through can draw 
upon economics to optimize the timing, location, and extent of 
temporary actions to maximize their net benefits to society. Econo- 
mists can also help to clarify how the net benefits of a migratory 
species vary over its range, helping to set the stage for regional 
negotiations that ensure a species’ survival is in the best interests 
of people throughout its entire range. Conservation plans for all 
species must account for habitat shifts that are happening because 
of climate change; a conservation portfolio approach to reserve 
site planning can efficiently help to ensure that species have sup- 
portive environments in an uncertain future (22). 

Economics can also help move beyond emergency measures to 
save species that are on the brink of extinction. Ongoing research 
can help ESA policy-makers and managers to enhance efforts 
and institutions that encourage private land owners to engage 
in preemptive conservation that avoids the need for endangered 
species listing at all (23). For example, successful preemptive con- 
servation is more likely to occur if action is taken to help coor- 
dinate multiple private landowners to avoid free-rider problems 
and to prompt conservation before the species is so endangered 
that success of preemption is costly and unlikely. Economics 
can inform strategic thinking about management strategies and 
policies once a species has been recovered enough to be taken 
off the list (24). For example, differentiating population require- 
ments for a species across the states in its range can reduce the 
ongoing costs of supporting the species’ survival, and ancillary 
policies such as compensation for direct and indirect damages 
from a species like the gray wolf can reduce private incentives to 
eradicate them. 


Pushing boundaries 
with new interventions 


By Stephen Palumbi#! and Michael Wara!# 


Protecting the literal foundation of tropical reef ecosystems requires 
new interventions that push the boundaries of historic implementa- 
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tion of the ESA and will require careful exercise of agency discretion 
under the Act. At the same time, preservation of reef ecosystems can 
be viewed as part of a decades-long effort to shift ESA protections 
from a focus on individuals to species to whole ecosystems within 
the confines of the statute. 

Originally ESA protections for corals, as for other taxa, focused 
on the need to conserve particular species and their habitats. Rapid 
decline of coral reefs owing to climate change-driven heatwaves has 
led to global efforts in coral protection and calls to restore reefs by 
growing corals in coastal nurseries. Yet these efforts rarely result 
in fully restored reefs, and many nursery-grown corals succumb to 
rising temperatures. A new strategy is to restore reefs with naturally 
occurring corals that exhibit resistance to heat damage (25). Breed- 
ing corals in aquaculture facilities for higher heat resistance, and 
hybridizing them with more heat-resistant species, are also under- 
way. In addition, engineering corals by changing their symbionts or 
microbiome or through gene editing has been proposed and proof-of- 
concept research conducted. 

Two coral species are fully protected under endangered species 
rules, and more are listed as threatened. How might coral interven- 
tions affect the protection of corals under the ESA (26)? Protecting 
and increasing species numbers in lab or zoo settings is in line with 
the ESA, as is adding individuals from different natural populations 
to enhance population diversity or adaptation (26). Adding new spe- 
cies may be more of a gray zone because they may supplant native 
corals under ESA protection, and thus may constitute a “take” under 
ESA regulation and require interagency consultation and permitting. 

Breeding corals for higher heat tolerance in the lab through 
artificial selection may be similar to efforts to breed black-footed 
ferrets for disease resistance (27), a project underway with USFWS 
approval (though release of these animals has not happened yet). 
The van Oppen team in Australia took internal symbionts from 
corals into the lab (28) and evolved them over 10 years to be 
more resistant to high heat. They stripped corals of their regular 
symbionts and substituted the lab-evolved strains. A few colonies 
successfully integrated the lab-evolved symbionts and grew well 
under higher-temperature conditions. ESA protections currently 
regulate “infecting” endangered corals with manipulated symbi- 
onts, as well as placing them back out into the field (26). 

Probably the most technically challenging of the possible 
interventions involves genetically engineering corals. Cleves and 
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colleagues (29) took a CRISPR injection rig to Australia for the 
few nights of coral spawning each year and injected thousands of 
eggs. Yet, such corals are not currently permitted to be released 
into the wild, and at this point there is no clear understanding 
of the genes that need to be altered to generate climate-resistant 
corals. In this case, if genes in the wild are found that confer 
climate protection, they can be used under the ESA. But creating 
those genes—even if it were possible—appears to be outside cur- 
rent ESA guidelines. 

Both the environmental harm to corals and new approaches 
to saving coral reefs are advancing at an accelerating pace. For 
most of these interventions, permits for research or enhance- 
ment, incidental take, or interagency consultation mechanisms 
are built into the ESA system and can and should allow explora- 
tion of these new possibilities. An important opportunity for fa- 
cilitating new conservation approaches could be the development 
of recovery plans for 15 Indo-Pacific coral species (30). 

Many mechanisms within the ESA balance between protection 
and enhancement of populations. For the last half-century, the 
ESA has been fundamental in protecting species as they existed 
in the past. Attention must turn to adapting its implementation 
to preserve ecosystems for the future. 


Learning to overcome barriers 
to adaptive management 


By Mark W. Schwartz? and Matthew A. Williamson 


A core challenge for public agencies responsible for ESA implemen- 
tation and enforcement is to clearly demonstrate success so as to be 
recognized by society as providing legitimate, effective governance. 
To be sure, species are recovering under the ESA. But how recovery 
funding drives that recovery is unclear. The USFWS has a ranking 
system to prioritize species, but recovery funding is poorly correlated 
with priority rank. Recovery plan actions for species are ranked, but 
to little effect. In addition, although federal agencies have adopted 
Adaptive Management, little effort is allocated to monitoring out- 
comes of actions (37). 

Effectiveness and legitimacy of the ESA are fostered through 
transparency and clear links between expenditures, actions, and 
outcomes. Over the past 20 years, drawing on advances in decision 
science and computational algorithms, conservation practice has cre- 
ated a variety of frameworks for planning, decision-making, spatial 
prioritization, evidence use, and outcome evaluation (32). Despite 
demonstrated successes (33), use of these frameworks to manage 
federally listed endangered species remains the exception. Ac- 
celerating the recovery of endangered species requires increas- 
ing the capacity to coordinate and efficiently invest in priority 
actions targeted to specific objectives combined with monitoring, 
learning, reporting, and adjusting future actions (37). 


If the ESA is to truly deliver on recovering endangered spe- 
cies, with strong credibility among the public, then an explicit, 
transparent rationale for resource allocation to achieve recovery 
objectives is vital. Recently revised public recovery documen- 
tation (Species Status Assessments, Recovery Plans, Recovery 
Implementation Strategy, and Implementation Action Tables) 
have improved planning and action tracking but fail to explicitly 
link actions to expected outcomes. Adaptive management is most 
effective when plans rest on a conceptual model that links ac- 
tions to recovery objectives through some theory of change (34). 

There are several barriers to strategic adaptive management 
of endangered species. First, and foremost, is that the lead ESA 
agencies, tasked with recovery planning and tracking, are minor- 
ity financial contributors to recovery actions. In fiscal year 2020, 
for example, just 8.4% of the $1.25 billion of public funding spent 
on endangered species came from the USFWS. As a consequence, 
the USFWS plans, prioritizes, and tracks actions that are almost 
entirely not their own. With just a small fraction of the funds, 
the USFWS has limited capacity to direct resources to priority 
species, a situation exacerbated by congressional earmarks for 
special interest species. Within species, the USFWS not only lacks 
the capacity to direct funds, but uses a three-tiered priority rank 
that is not driven by decision tools to optimize outcomes. 

Lacking the required financial resources to fully implement 
recovery actions, public agencies make difficult choices in fund- 
ing actions, and critical monitoring and evaluation functions 
appear frequently left out. Inadequate monitoring and evaluation 
not only undermine the ability of agencies to justify their actions 
to the public, but also forego efficiencies to be gained through 
learning and improving. Better cross-agency coordination and 
collaboration is needed to make optimal use of limited resources 
and implement adaptive management. Interventions on behalf of 
endangered species are experiments from which we must learn (35). 


Sustainable, trustworthy, 
human-technology partnership 


By Tanya Berger-Wolf!*6, Sara Beery”, David Rolnick!®9, Justin 
Kitzes2°, David Thau2!, Devis Tuia22, Daniel Rubenstein2? 


Despite conservation successes, we are in the middle of a mass 
extinction without even knowing all that we are losing and how fast. 
To address the urgency and scale of these challenges, there has been 
an explosion of technology developed to collect data on biodiversity, 
and parallel advances of computational methods in data analysis, 
machine learning (ML), artificial intelligence (AI), and cyberinfra- 
structure. The goal is to fill the data gap and turn raw data into high- 
resolution information about living organisms, enabling scientific 
inquiry, conservation, and policy decisions (36). 

For example, the National Oceanic and Atmospheric Administra- 
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A field biologist adds scent to the ground to encourage animals to stop in front of a trail camera, a technology increasingly used to survey animal species. 


tion (NOAA) Advanced Sampling and Technology for Extinction Risk 
Reduction and Recovery (ASTER3) (37) program for fisheries’ conser- 
vation uses aerial and submersible drones, acoustic sensors, satellite- 
and drone-based imaging, -omics, and AI/ML. The USFWS uses ML 
to track and count bird populations from aerial images, while the 
Bureau of Ocean and Energy Management tracks and counts whales 
using algorithms to identify individuals in photographs (38). States 
use motion-sensitive cameras and an ML image object detector (39) 
to survey animal species, and smartphone-enabled citizen scien- 

tists count insects, identify bird songs, and report millions of plant 
observations (40). Algorithms developed for deploying US Federal 
Air Marshals are now used to plan park rangers’ routes and deter 
wildlife criminals (47). Techniques for planning robot paths are lever- 
aged to prioritize areas for biodiversity conservation (42). 

The technological shift has the potential to enable a more ef- 
fective, affordable, highly automated, globally distributed, locally 
relevant, real-time biodiversity monitoring system, improving 
equity across taxa as well as geographic regions (43). Technology 
can change the scale of conservation efforts by expanding our un- 
derstanding of habitats and communities and predicting how they 
might change under different protection plans. 

Yet, technology comes with risks. The collected data are biased, 
missing large areas and many taxonomic groups. Lack of robust 
collaborations between conservation biologists and computational 
and AI experts slows the development of computational tools. Bi- 
ased data and methods will distort evaluation and discovery. If not 
carefully used, relying on technology for nature observation can 
also distance humans from nature, severing an important personal 
connection needed to inspire the next generation of scientists and 
nature lovers. 

Technology is expensive, and conservation is already under- 
funded, leading to inequities in access to tools, data, and compu- 
tational resources, as well as the expertise needed to use them. 
Technology pulls funding away from other needs, not always with 
commensurate impact, with the focus too often on using technol- 
ogy for its own sake. Conservation needs buy-in and support from 
local communities who are directly affected by nature loss. However, 
technology can undermine community or individual data rights or 
be deployed in languages or with cultural norms not relevant locally. 
But when local communities are given agency in every part of the 
process, trusting and productive partnerships can be developed. 
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The same technologies that accelerate impact can also accelerate 
environmental and conservation risks. Computation can be energy 
and water intensive, and computing hardware uses rare metals 
and produces e-waste. Additionally, Al-enabled data-gathering 
tools may be used by nature criminals, and are already used exten- 
sively to aid oil and gas extraction. 

To be useful, technology needs to be a sustainable and trust- 
worthy partner. Ultimately, technology alone, even AI, will 
not save the planet’s species. But neither will humans alone. 
Human and machine partnership for conservation is our best 
chance for success. 


Adding tribal experience 
and removing inequity 


By Caleb R. Hickman?‘ and Julie Thorstenson?° 


Over millennia, tribes have learned to coexist with species on the 
American continent. Despite centuries of conflict with settler- 
colonial Americans, federally recognized tribes influence the 
management of nearly 140 million acres in the US. These lands 
feature diverse legal classifications offering protective status with 
regulatory burdens specific to tribes (44). Studies underscore the 
pivotal role of Indigenous stewardship, revealing higher biodiver- 
sity levels and more borders with protected areas compared to 
adjacent state lands (44). Indigenous-managed lands constitute 
only 2.6% of the US but overlap with 12% of Key Biodiversity 
Areas, underscoring their ecological importance (44). 

Despite their crucial conservation role, tribes face consider- 
able funding disparities compared to states. Tribes miss out on 
the annual federal aid of more than $1 billion allocated to states 
for conservation under the Wildlife Restoration Act (Pittman- 
Robertson) and the Sport Fish Restoration Act (Dingell-Johnson). 
In addition, the State Wildlife Grant (SWG) program, distributed 
on the basis of species and land area (which often lumps in tribal 
lands), allocated $1.2 billion over 20 years for nongame spe- 
cies conservation to every state. Tribes do not receive the SWG 
funds, but a similar fund for tribes is the extremely limited Tribal 
Wildlife Grant (TWG) program. The TWG program, funded with 
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The Shoshone-Bannock Tribes have worked to protect and restore the Snake River and endangered sockeye salmon. 


competitive awards and capped at $200,000 per project, has 
disbursed $105.6 million since 2002, yet only 25% of applica- 
tions received funding (44). This results in capacity challenges, 
whereby tribes must juggle all taxa management (among other 
natural resources) while state agencies can hire specialized 
biological staff that focus on single taxon groups, and single 
species. Insufficient funds result in reduced conservation ef- 
forts, management sovereignty, and tribal lifeways. 

Tribally held priorities can be overlooked when applying the 
ESA. We recommend applying Indigenous perspectives to the 
ESA as a way to avoid neocolonial American practices. Lamb et 
al. (45) recommend integrating food security and cultural rela- 
tionships into species recovery plans. We see a need to revital- 
ize traditional coexistence philosophies (46) while avoiding the 
“Ecological Indian” fallacy that overromanticizes the role of 
Indigenous people as historical stewards rather than modern 
practitioners (47). Without Indigenous coexistence philoso- 
phy in modern management, neocolonial influences may lead 
tribes to embrace market trade and industrial capitalism (3). 
Despite inquiries about the “in vogue” traditional ecological 
knowledge, tribes’ philosophies often remain overlooked with- 
out direct federal oversight, which is an all too often paternal- 
istic approach (48). 

To achieve true species recovery for all, a shift in conserva- 
tion philosophy is essential (46). Tribes, as sovereign nations, 
face a disproportionate burden compared to US citizen states, 
which is antithetical to the Secretarial Order 3206 that empha- 
sizes cultural deference. A recent inclusive step occurred when 
President Biden’s administration supported a memorandum on 
Tribal Consultation, which can include species conservation. 

By recognizing the distinct challenges that tribes face, the 
scientific community can also play a vital role in reframing the 
ESA into a law for everyone. This equity can be realized when 
a coproduced form of conservation includes tribal knowledge 
systems and priorities from the beginning and shares re- 
sources throughout the process (48). 


C.R.H. (D@AS7%) is a citizen of the Cherokee Nation. J.T. is Lakota and a citizen 
of the Cheyenne River Sioux Nation. 


1354 22 DECEMBER 2023 + VOL 382 ISSUE 6677 


Genome editing 
and deliberate extinction 


By Gregory E. Kaebnick?®, James P. Collins?”, Athmeya Jayaram26 


Advancing genome editing technologies, prominent among them 
synthetic gene drive systems, may lead to methods for suppress- 
ing or locally exterminating some species, or even driving them 
extinct. How that prospect accords with the ESA is an emerging 
policy issue with potentially profound ramifications for environ- 
mental, public health, and agricultural policy. 

Effects of a gene drive system would depend on the design of 
the system, e.g., alleles targeted for change, diversity of a species’ 
gene pool, and a species’ population structure (49). For example, 
a drive that reduced female fertility in malaria-transmitting 
mosquitoes Anopheles gambiae could lead to substantial popula- 
tion decline on a regional scale, though modeling suggests that 
complete extinction is unlikely (50). Gene drive systems are also 
in development to eliminate invasive rodent populations that 
threaten other species on oceanic islands (57). Other candidate 
species include the New World screwworm, a blow fly causing 
considerable damage to livestock and posing a threat to humans, 
for which genes required for female development or fertility have 
been identified and could be targeted by gene drive (52). 

Eliminating disease-carrying mosquitoes and screwworm 
appears to be permissible under the ESA, which exempts insect 
pest species that present “an overwhelming and overriding risk” 
to humans. The law’s applicability to a widespread population 
decline of invasive rodent populations due to a gene-drive system 
is more ambiguous. ESA protections apply only after a species is 
listed as endangered, which typically requires evidence that the 
species has already declined. The ESA, therefore, may not apply 
to the prospective threat of a gene drive (53). Additionally, how 
genome editing would be considered in the listing process is not 
settled. The factors that trigger listing have generally been exter- 
nal threats to a species such as hunting and habitat change, not 
genetic alterations integrated into a species’ gene pool. 
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But if the letter of the law needs clarification, the spirit of the 
ESA clearly places an extremely high value on species and rules 
out eradication in most cases. The exception made for insect pests 
shows, however, that some goals, such as preventing the enormous 
public health harms associated with some insects, might override 
that high value. Exactly which harms are overriding—and whether 
they are posed only by insects—are important questions. But, 
plainly, if the ESA is taken to heart, genetic interventions that could 
lead to a species’ extinction should be evaluated very conservatively 
and would be acceptable only rarely. 


Regulating trade toward global 
sustainable development 


By Thomas Deleuil28 and Ying Zhao2® 


A group of scientists and environmental managers in 1963 called 
for “an international convention on regulation of export, transit 
and import of rare or threatened wildlife species or their skins and 
trophies” as enshrined in a resolution of the IUCN (54). In 1973, 
based on a recommendation of the 1972 Stockholm Conference on 
the Human Environment, the United States hosted a Plenipoten- 
tiary Conference in Washington, DC, where the CITES was opened 
for signature. The US was the first signatory country to ratify the 
Convention, which came into force in 1975 (55). 

CITES is an international legally binding agreement regulat- 
ing billions of dollars of international trade in specimens of wild 
animals and plants. The two cardinal rules are that trade shall 
not be detrimental to the survival of the species and traded 
specimens must have been acquired legally (56). 

Parties are required to adopt domestic legislation to ensure 
full implementation of CITES at the national level. In the US, 
the ESA is the domestic legislation for CITES. The USFWS acts 
as both the Management Authority and Scientific Authority for 
CITES, tasked with providing scientific advice, verifying the 
legality of specimens traded, issuing CITES permits and certifi- 
cates, enforcing pertinent laws, and submitting trade reports, 
for example (57). 

The ESA not only encompasses US international obligations 
under CITES but also imposes, in some cases, stricter domestic 
measures above CITES standards, including the protection of 
species that are not covered by the Convention. For instance, the 
ESA mandates the development of recovery plans for the conser- 
vation and survival of listed species when deemed necessary 
(56). Thus, the ESA also demonstrates the commitment of the 
US to the conservation of species. 

Today, CITES has 184 signatory Parties and regulates trade 
in over 40,900 species. It is one of the most successful interna- 
tional environmental treaties concerned with nature conserva- 
tion. Notably, no species listed under CITES has become extinct, 
and the US—through ESA—has played its part in this global 
effort. However, since the adoption of the Convention in 1973, 
societies have evolved and environmental threats have multi- 
plied. Through ESA, the USFWS, Office of Law Enforcement, has 
also played and continues to play an important role in combat- 
ing illegal wildlife trade, including online trade (58). 

The pressure to regulate trade in and conservation of wild 
species is growing. In an increasingly complex world, as both 
CITES and ESA mark their 50th anniversaries, they are more 
relevant than ever to ensure that wild species are conserved 
for the benefit of people, planet, and prosperity to achieve the 
United Nations Sustainable Development Goals. 
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Genetic insight on ice sheet history 


Octopus DNA reveals timing of the most recent collapse of the West Antarctic Ice Sheet 


By A. Dutton! and R. M. DeConto2 


t the South Pole of Earth sits a land- 

mass that has been home to a conti- 

nent-sized ice sheet for the past 34 

million years. Scientists have long 

been drawn to this icy expanse, ini- 

tially to explore and discover, and 
later to study the evolution of the Antarctic 
continent, climate, and ice sheet. Despite 
decades of probing the region from the 
surface and from space, some important 
aspects of Antarctic history remain elu- 
sive. On page 1384 of this issue, Lau et al. 
(1) make headway to fill one 
such knowledge gap by re- 
porting genetic evidence that 
two distinct populations of 
octopus were connected by a 
waterway across an area now 
completely covered by the 
ice sheet. This suggests that 
the West Antarctic Ice Sheet 
(WAIS) collapsed during a 
past warm period ~129,000 to 
116,000 years ago, known as 
the Last Interglacial. 

More than 50 years ago, 
glaciologist John Mercer was 
the first to deduce from sedi- 
mentary evidence that the 
WAIS melted away at some 
point during the Pleistocene 
(2.58 million to 11,700 years 
ago) (2). By leveraging inde- 
pendent evidence for sea lev- 
els that were several meters 
higher than present day during the Last 
Interglacial, Mercer hypothesized that 
the WAIS disintegrated during that warm 
period. Presciently, Mercer proposed that 
if that had indeed happened, then within 
50 years, the disintegration of the WAIS 
would be imminent, leading to a globally 
averaged sea-level rise of about 5 m (3). 

Although global mean sea surface tem- 
perature during the Last Interglacial was 
roughly the same as today (4), the Antarctic 
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warmed to temperatures that were several 
degrees higher than present day (5). Future 
warming scenarios suggest that WAIS col- 
lapse could be triggered within several de- 
cades (6), if it has not already begun (7). 
This would bring about an estimated aver- 
age 4 to 5 m of global sea-level rise (8), al- 
though the seas would rise higher in some 
places (such as the US east coast) relative 
to others because of changes in the gravity 
field associated with ice loss (9). Knowing 
the conditions under which the WAIS col- 
lapsed during the Last Interglacial should 
help pinpoint how and when such an event 


Ice sheet and octopus geography 
Octopuses in the Amundsen, Ross, and Weddell Seas are isolated by today’s ice 
sheet (left). If the vulnerable, marine-based sectors of the ice sheet retreated 
during the Last Interglacial (right), the resulting seaways would have allowed ancient 
connections between octopus populations compatible with genetic data. 
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will unfold again under anthropogenically 
driven future warming, with enormous 
policy implications for global mitigation 
and adaptation efforts. 

In the decades that followed Mercer’s 
work, scientists set out to test his hypoth- 
esis in various ways. Some focused on 
looking for evidence within the Antarctic 
ice sheet itself (10) or within sediments off 
shore (17). Others searched further afield 
to combine estimates of total sea-level rise 
with existing estimates of Greenland melt 
volume to calculate the Antarctic con- 
tribution (by subtraction) (72). Modelers 
tried to discern how the Antarctic ice 
sheet would have responded to the Last 


Last Interglacial 


Interglacial climate conditions (6). Using 
this multipronged approach, firmly cen- 
tered on the geologic history, evidence has 
been mounting that the WAIS may have 
collapsed during the Last Interglacial. 
However, each study’s findings have come 
with caveats. The general conclusion from 
much of this work is that the findings are 
largely circumstantial, with multiple pos- 
sible interpretations. 

In a pioneering study, Lau et al. bring 
an entirely different dataset to bear on the 
increasingly urgent question of whether 
or not the WAIS collapsed during the Last 
Interglacial. The study brings 
molecular biologists and ge- 
ologists together to discern 
whether the genetic history of 
Turquet’s octopus (Pareledone 
turqueti) could reveal the 
timing of when a waterway 
may have formed if the WAIS 
fully retreated. Although a 
disappearance of the WAIS 
would result in a massive in- 
undation of coastal regions, it 
would simultaneously open 
up areas on the seafloor that 
are currently covered by ice, 
allowing Turquet’s octopus 
to colonize new habitats and 
he mix genes with populations 

that were formerly geographi- 

cally isolated from each other. 

Lau et al. focused on the 

modern populations of P. 
turquett in the Amundsen, 

Ross, and Weddell Seas that are situated 
in the Southern Ocean off the coast of the 
WAIS. The authors determined that these 
populations are genetically distinct but 
have some admixture, indicating a poten- 
tial historical trans-West Antarctic seaway 
(see the figure). Given the current ice sheet 
configuration, historical gene flow could 
only occur if connected by an interior wa- 
terway across regions of the WAIS that are 
currently grounded below sea level. Their 
genomic data analysis detected a historical 
gene flow from the Ross Sea to the Weddell 
Sea and vice versa. Specifically, the timing 
of this connection in their demographic 
model agrees with the timing of the Last 
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Interglacial, implying WAIS retreat of the 
marine-based portions of the ice sheet. 
Whether or not this analysis withstands 
further scrutiny and the test of time, the 
implications of this result pose some in- 
triguing questions, including whether this 
history will be repeated, given Earth’s cur- 
rent temperature trajectory. 

Answering this question requires resolv- 
ing additional questions about the timing, 
nature, and conditions of past deteriora- 
tion of the WAIS. What were the physical 
conditions that primed this sector of the 
ice sheet to retreat, and precisely when did 
it happen? If the WAIS retreated early in 
the Last Interglacial as some data suggest 
(10, 12), was this event the consequence of 
changes in ocean currents, temperatures, 
and/or solid earth response that preceded 
the interglacial? If the trigger occurred 
just before the warm period, then perhaps 
the simplistic emphasis on how warm it 
got during the interglacial should not be 
a focus. 

There are also questions about how 
quickly sea level rises as the WAIS disinte- 
grates. Would it rise relatively slowly and 
gradually, drawn out over millennia, or 
would it rise in one or more rapid jumps as 
vulnerable sectors of the ice sheet retreat? 
Understanding the past nature of ice loss 
informs future sea-level rise projections, 
which are of fundamental importance for 
coastal planners. 

The problem, perhaps, runs even deeper 
than these specific, scientific questions. 
The challenge in identifying a precise 
tipping point—and all the conditions 
thereof—is that the tipping point will likely 
not be apparent until it has been passed. 
Policy-makers will always have to make 
decisions in the face of uncertainty about 
the future, and this latest piece of evidence 
from octopus DNA stacks one more card 
on an already unstable house of cards. 
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SIGNALING 


Deciphering downstream 
receptor signaling 


Advancing drug discovery requires increasingly integrative 


structural biology approaches 


By Marta Filizola! and Jonathan A. Javitch234 


protein-coupled receptors (GPCRs) 
are important cell-surface signaling 
proteins that are responsive to di- 
verse extracellular stimuli and are 
key drug targets (7). Understanding 
how compounds activate GPCRs 
and modulate their interactions with in- 
tracellular proteins such as G proteins and 
B-arrestins is crucial for drug discovery be- 
cause these proteins transduce signals to 
downstream effectors, triggering biologi- 
cal responses. This includes elucidating the 
molecular details behind the ability of the 
drug-GPCR complex to generate a func- 
tional response (efficacy) and the concen- 
tration of the drug required to produce half- 
maximal response (potency) (2). Although 
agonist binding to a GPCR 
triggers conformational re- 
arrangements throughout 
the receptor and its trans- 
ducer (3), the molecular 
mechanisms that govern 
ligand efficacy and potency 
are difficult to ascertain. 
On page 1378 of this is- 
sue, Heydenreich et al. (4) 
explored how individual 
amino acids in the proto- 
typical G,-coupled B»-adrenergic receptor 
“interpret” information encoded in the at- 
oms of its endogenous agonist, adrenaline, 
to drive its efficacy and potency. 
Traditionally, the quest to decipher GPCR 
signaling has focused on recording agonist- 
specific functional responses, but the mo- 
lecular determinants and steps involved in 
these responses have largely remained ob- 
scure. Most pharmacological, structural, and 
mutational studies of GPCRs have focused 
on the ligand-binding pocket, the GPCR- 
transducer interface, or both. In a painstak- 
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“athe molecular 
mechanisms that 
govern ligand 
efficacy and potency 
are difficult 
to ascertain.” 


ing study spanning multiple domains of in- 
vestigation, including alanine mutagenesis, 
a bioluminescence resonance energy trans- 
fer (BRET) functional assay, analysis of 
inactive and active crystallographic struc- 
tures, computational data analysis, and 
evolutionary analysis, Heydenreich et al. 
have undertaken what they call an integra- 
tive approach to understand the GPCR com- 
munication network (that is, the allosteric 
pathway through which agonist binding is 
communicated from its binding site on the 
receptor to the GPCR-G protein interface). 
Changes on the receptor’s surface or at dis- 
tant allosteric sites can be just as impact- 
ful as those at the ligand-binding site or the 
receptor-transducer interface. Notably, the 
allosteric network of noncovalent contacts 
that they identify in the B»-adrenergic re- 
ceptor, a major drug target 
for the treatment of respi- 
ratory diseases and heart 
failure, involves pharmaco- 
logically important residues 
that contribute as drivers, 
modulators, passengers, 
and bystanders to the mo- 
lecular and structural foun- 
dations of adrenaline’s po- 
tency and efficacy. 

Although distinct from 
key concepts of probability and informa- 
tion theory analyses that provide insights 
into the rules that govern information and 
uncertainty (5, 6) in physics-based evalua- 
tions of the molecular dynamics of a sys- 
tem, the important residues identified by 
Heydenreich ez al. can in principle be used 
to guide ligand design. This could enable the 
identification of chemical groups that can 
be modified to achieve desired signaling re- 
sponses and eventually be tested iteratively 
and also explored in the context of other 
agonists. Notably, the observation that sur- 
face-exposed driver, modulator, and passen- 
ger residues identified with an endogenous 
ligand are also targeted by drugs acting 
as negative and positive allosteric modu- 
lators suggests that these sites should be 
prioritized in high-throughput virtual 
screening efforts for the discovery of new 
allosteric modulators. 
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Evolutionary analyses indicate possible 
overlaps in closely related systems (7). The 
study of Heydenreich et al. concludes that 
the functional role of a residue cannot be 
predicted solely on the basis of evolutionary 
conservation. It is plausible that the resi- 
dues and networks involved in the efficacy 
or potency resulting from ligand binding 
are distinct and nonoverlapping, varying 
with the specific ligand, GPCR, and/or in- 
tracellular partner subtypes. 

Notably, a data analysis based on efficacy 
and potency measurements for a single li- 
gand derived from a highly amplified as- 
say, point mutations limited to substitution 
with alanine in a single receptor, and al- 
losteric networks derived from the analy- 
sis of two static receptor conformations 
inevitably overlooks two critical aspects of 
GPCR signaling that have emerged in the 


“Changes in amino acids far 
from the ligand-binding site can 
substantially alter drug response 
and receptor signaling...” 


past decade. These are pluridimensional 
efficacy, the ability of the GPCR to engage 
multiple G protein and non-G protein effec- 
tors; and biased agonism, which is the abil- 
ity of the GPCR to preferentially engage one 
effector over another (8). Contrary to the 
traditional view of GPCR-mediated signal 
transduction as a linear pathway mediated 
by single GPCR-G protein complexes, it is 
now evident that the inherent conforma- 
tional plasticity of the receptors is exploited 
by ligands inducing maximal stimulation 
(full agonism) or lower efficacy (partial 
agonism). These ligands stabilize receptor 
conformations that preferentially couple to 
and/or activate different G protein subtypes 
or non-G protein effectors, enabling differ- 
ent downstream signaling cascades that can 
result in either beneficial or adverse physi- 
ological effects. 

The resulting functional selectivity can 
originate not only from _ ligand-specific 
conformational changes in the receptors 
but also in the G protein or non-G pro- 
tein partners, as well as from the varying 
kinetics of the transducer’s conformational 
dynamics and its association with and dis- 
sociation from the receptor (9). These as- 
pects of GPCR signaling necessitate high- 
resolution spatiotemporal measurements 
of the stability, dynamics, and activation 
kinetics of the GPCR-transducer complex, 
which can be conflated in amplified assays 
such as the BRET functional assays used 
by Heydenreich et al. Readouts from these 
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assays are time-averaged and represent an 
ensemble average of the receptor-G pro- 
tein complex, interpreted through the lens 
of a steady-state impact on BRET between 
the G; a and y subunits. The conclusion of 
Heydenreich e¢ al. that the pharmacological 
importance of a residue cannot be inferred 
merely from the extent of its conforma- 
tional change during receptor activation, as 
determined from the analysis of only two 
high-resolution receptor-G protein com- 
plex structures, underscores that the sys- 
tem’s dynamics cannot be ignored. These 
dynamics can be crucial for characterizing 
agonist-induced allosteric pathways that 
extend beyond the receptor and receptor- 
transducer interface into the transducer 
protein. This consideration should also ac- 
count for the impact of different lipid envi- 
ronments on signaling, given the growing 
evidence that not only cholesterol, but also 
other cell membrane lipids such as anionic 
lipids, can influence both receptor activa- 
tion and G protein coupling (10, 11). 

The realization that changes in amino 
acids far from the ligand-binding site can 
substantially alter drug response and recep- 
tor signaling challenges both the pharma- 
ceutical industry and academic researchers 
to rethink drug design strategies—to not just 
focus on hypotheses of chemical optimiza- 
tion derived from analyses of direct ligand- 
receptor interactions in the orthosteric bind- 
ing site. Instead, it is necessary to explore 
the rich landscape of a receptor’s molecu- 
lar surface and its internal communication 
network, as indicated by the analyses of 
Heydenreich et al. In the future, such studies 
can be further enhanced by integrating com- 
putational and experimental measurements 
of ligand-specific conformational dynamics 
and activation kinetics in GPCR-transducer 
complexes to fully decipher the molecu- 
lar basis of ligand efficacy and potency at 
GPCRs for the development of improved 
therapeutics. 
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Mimicking 
polar bear 
hairs in aerogel 
fibers 


Encapsulated aerogel fibers 
offer thermal insulation, 
breathability, and strength 


By Zhizhi Sheng! and Xuetong Zhang?2 


erogels have been considered as su- 

per thermal insulators since their 

invention in 1931. The successful 

adoption of aerogels, which are 

made by replacing the liquid in a gel 

with gas while maintaining a stable 
network, in NASA vehicles such as the 
Mars rover has inspired the use of aerogels 
in personal warming. But applications of 
aerogels in textiles have been limited by 
their insufficient moisture permeability. 
Aerogel fibers are being developed to ad- 
dress the trade-off between thermal insula- 
tion and moisture permeability. However, 
aerogel fibers presently lack the strength 
and flexibility necessary to allow weav- 
ing or knitting into a wearable fabric. On 
page 1379 of this issue, Wu et al. (1) report 
a feasible strategy to construct bioinspired 
knittable aerogel fibers with superb ther- 
mal insulation and mechanical robustness, 
which allows weaving into fabric with air 
permeability. This could instigate the de- 
velopment of advanced thermal textiles for 
personal warming. 

Aerogel fibers can be obtained by spin- 
ning nanoscale building blocks (e.g., ara- 
mid nanofibers, silica nanoparticles, gra- 
phene nanosheets, and so on) in a spinning 
dope with sol-gel transition and subse- 
quent special drying (2-7). By modulating 
the interactions between these building 
blocks, various aerogel fibers with desired 
mechanical properties can be achieved, 
making bending, knotting, twisting, and 
weaving possible (8). However, at the pres- 
ent time, aerogel fibers cannot be woven 
into large textiles because of their insuffi- 
cient strength; they also cannot withstand 
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machine washing. Thus, they do not meet 
the requirements for modern textile ap- 
plications. Wu et al. fabricated encapsu- 
lated aerogel fibers (EAFs) by mimicking 
the core-shell characteristic of polar bear 
hairs, which allows heat insulation in ex- 
tremely cold environments in the Arctic. 
The EAFs are composed of a chitosan aero- 
gel core that can trap air in the abundant 
air pockets (more than 90% porosity) and 
provide sufficient thermal insulation. The 
thermoplastic polyurethane (TPU) shell 
affords mechanical robustness (1000% 
tensile strain). The textile made from 
EAFs has a thermal conductivity of 26.9 
+ 1.8 mW/m-K, outperforming wool, ny- 
lon, and polyester textiles. The EAF textile 
is only one-fifth as thick as down, which 
has a comparable thermal insulation per- 
formance. In -20°C, the EAF sweater pro- 


tinuous spinning procedure used to make 
traditional textiles. To solve this, Wu et al. 
produced EAFs by freeze-spinning a pre- 
cursor, which then undergoes a dynamic 
sol-gel transition followed by freeze-drying 
and coating-drying steps. These fibers are 
ready for use and do not require compli- 
cated postprocessing steps such as conven- 
tional yarn-spinning in textiles. As a result 
of this processing method, the sweater 
knitted with EAF fibers by Wu ez al. costs 
less for source material but has the same 
warmth as down and also has excellent 
performance, including washability, dye- 
ability, and flexibility. Importantly, the 
EAF fibers can also retain stable thermal 
insulation after more than 10,000 stretch- 
ing and releasing cycles. 

Compared with electrospun fibers and 
other traditional porous fibers, aerogel fi- 


Next-generation textiles with enhanced thermal insulation 

Polar bear hairs are composed of a porous core surrounded by a dense shell, which provides heat insulation 

in extremely cold conditions. Mimicking the structure of these hairs by encapsulating an aerogel core ina 
thermoplastic polyurethane (TPU) shell confers mechanical robustness and superior thermal insulation. These 
encapsulated aerogel fibers (EAFs) can be woven, dyed, stretched, and washed, making them ideal for use in 
textiles; EAFs could also have other applications beyond personal warming. 
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duced by Wu et al. exhibited thermal pro- 
tection superior to that of a down jacket, 
wool sweater, and cotton sweater. 

Aerogel fibers are promising materials 
for next-generation thermal insulation 
textiles (8). For these fabrics, individual 
aerogel fibers can provide the best ther- 
mal insulation while the interfiber voids 
allow breathability (see the figure). The 
superiority of aerogel fibers in recent tex- 
tile history is reflected in the following two 
key aspects: processing and performance. 
The traditional sol-gel transition is a slow 
and static process (8) that is difficult to 
couple with the fast, dynamic, and con- 
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Encapsulated aerogel fibers 


bers have ultrahigh specific surface area 
and superhigh porosity. Thus, in addition 
to their use in textiles, aerogel fibers may 
be advantageous in adsorption, separa- 
tion, optics, energy, and other applications 
(4, 5, 9-11). The existence of mesopores in 
aerogel fibers, together with the macro- 
pores formed by overlapping fibers, makes 
it possible to achieve both high sorption 
capacity and fast sorption kinetics. These 
abundant multiscale porosities are also 
helpful in solving the conflict between 
permeability and selectivity in membrane 
separation. Furthermore, the transparency 
of aerogel fibers can be tuned by rational- 


izing the size of nanoscale building blocks 
(e.g., silica nanoparticle size) for optical 
devices (3). Notably, the richness of nano- 
pores in aerogel fibers could allow entrap- 
ping guest materials in these pore spaces, 
for instance, molecules, nanoparticles, and 
functional liquids stabilized by capillary 
forces (2, 4, 12-14), which will be useful 
for a myriad of applications. For example, 
the introduction of phase-change fluids 
in aerogel fibers achieves both thermal 
energy storage and photothermal or elec- 
trothermal responsiveness (5). Moreover, 
adjusting the interior nanopockets of aero- 
gel fibers could yield materials with appli- 
cations in environmental remediation and 
biomedical science, such as carbon dioxide 
capture (73) and efficient hemostasis (75). 

Although aerogel fibers have witnessed 
tremendous progress, challenges exist re- 
garding how to develop fast spinning tech- 
nology and resolve the continuous fabrica- 
tion that is necessary for mass production. 
It is crucial to understand spinning ther- 
modynamics and kinetics to decouple the 
mismatch between the slow sol-gel tran- 
sition rate and the fast spinning rate. At 
present, the wet spinning rate of aerogel 
fibers is around 20 m/min, which is much 
slower than the industrially available wet 
spinning (200 m/min), dry spinning (1500 
m/min), and melt spinning (5000 m/min) 
of traditional fibers. Additionally, although 
continuous spinning and batch drying have 
been established for aerogel fibers [by Wu 
et al. and also see (2, 3)], a feasible strat- 
egy to bridge the two processes to realize 
a fully continuous process remains elusive. 
To achieve whole continuous production of 
aerogel fibers, unraveling the mechanism 
of solvent removal during a dynamic rapid 
decompression is critical, accompanied by 
instrument capability. With the simultane- 
ous advancement of materials and fabrica- 
tion, aerogel fibers could have many poten- 
tial applications. © 
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Neurosurgery elucidates somatic mutations 


Surgical innovation is helping to identify roles for somatic mutations in brain disorders 


By Eduardo A. Maury!:3, Christopher A. 
Walsh!34, Kristopher T. Kahle!5-57 


omatic variants are genomic altera- 
tions present in some cells of the 
body but not in others. In contrast 
to inherited germline variants, so- 
matic variants are typically acquired 
during postzygotic cell divisions or 
from environmental factors. Somatic vari- 
ants in a given tissue can also be highly 
dynamic, such as in the brain, where the 
number of clonal variants can change dur- 
ing aging and disease (7). Neuro-oncology 
has greatly benefited from the analyses of 
somatic mutations in tissue obtained from 
surgical biopsies or resections. Increased 
understanding of somatic mutations in 
brain tumors brought a revolution in prog- 
nostication and individualized treatments. 
Somatic mutations are now increasingly 
being implicated in neurological disorders 
beyond cancer, particularly in children 
with congenital brain and cerebrovascu- 
lar diseases. Teams of neurosurgeons and 
scientists are performing gene sequencing 
on surgical brain samples from these pa- 
tients—paving the way to expand under- 
standing of nononcologic brain disease 
through the study of somatic mutations. 
To date, most studies of somatic muta- 
tions in nononcologic neurological disease 
have relied on post-mortem brain samples 
or samples from other tissues, such as the 
blood. This is partly owing to the limited 
number of medically indicated approaches 
for obtaining brain tissue while the patient 
is alive and current clinical practice that 
frequently discards tissue during surgery 
because it is thought to be of limited diag- 
nostic value. This lack of fresh brain tissue 
limits the sensitivity and specificity of so- 
matic mutation detection because DNA and 
RNA quality can degrade post-mortem, and 
variants specific to brain cells might not 
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be present in nonbrain tissue. In addition, 
blood, which is the most readily available 
tissue for sampling, can accumulate tissue- 
specific somatic mutations such as variants 
acquired through clonal hematopoiesis, 
which complicates the detection of somatic 
variants present in the nervous system. 

Several recent studies have demon- 
strated the power of exploring somatic 
mutations directly from tissue collected in 
the operating room to gain insights into 
neurological disorders. These disorders 
include congenital focal cortical dysplastic 
syndromes such as focal cortical dysplasia 
(FCD) and hemimegalencephaly (HMG) 
(2-4). Clinically, patients with these con- 
ditions present with seizures refractory to 
pharmacological management and are di- 
agnosed by use of brain imaging to detect 
lesions that disrupt the cortical layering of 
the brain. Lesion size can vary from sub- 
centimeter in FCD to a whole hemisphere 
in HMG. The current treatment is radical 
surgical excision of the lesions, which puts 
the patient at high risk of morbidity, espe- 
cially in the case of hemispherectomy to 
treat HMG. 

Genetic causes of these conditions were 
initially difficult to identify because they 
only affected a subset of cells. Blood sam- 
ples obtained from patients with FCD and 
HMG appear largely normal when analyzed 
with traditional sequencing approaches 
that capture germline variants. However, 
the development of next-generation se- 
quencing technologies and bioinformatic 
pipelines has made it possible to identify 
somatic variants from neurosurgically re- 
sected samples. Using this approach, so- 
matic mutations in the mechanistic target 
of rapamycin (MTOR) gene that increase 
mTOR pathway activity were identified as 
causative of these disorders. The mTOR 
pathway helps determine cell size and has 
been implicated in cancer. Larger lesions 
had a larger proportion of cells affected by 
the mutations (4, 5). These genetic insights 
were brought about through collaborations 
between neurosurgeons and scientists and 
led to a new clinical classification system 
for FCD (6). 

Inhibitors of the mTOR pathway now 
represent promising therapeutic ap- 
proaches for patients with FCD and HMG. 
Small-molecule inhibitors used to treat 
cancer could in theory be used as mono- 


therapies or adjuncts to surgical resec- 
tion for the treatment of FCD or HMG. 
These inhibitors include sirolimus and 
evorolimus, which target mTOR directly, 
or duvelisib, and miransertib (ARQ-092), 
which target proteins that interact with 
mTOR. Encouragingly, treating patients 
with tuberous sclerosis and phosphatase 
and tensin homolog (PTEN) hamartoma 
tumor syndrome (both neurodevelopmen- 
tal disorders) with evorolimus resulted in 
improved clinical outcomes in random- 
ized clinical trials (7, 8). Phase 2 clinical 
trials on the effect of evorolimus on FCD 
are currently underway in the United 
States (NCT02451696) and South Korea 
(NCT03198949). However, questions re- 
main on the optimal inclusion criteria or 
treatment combinations for these studies. 

More recently, somatic mutations have 
been implicated in mesial temporal lobe 
epilepsy, which is one of the most common 
forms of epilepsy and is often resistant to 
treatment with antiseizure medication. 
Whole-exome sequencing of 105 hippocam- 
pal samples from patients with this type of 
epilepsy who underwent surgical treatment 
revealed pathogenic somatic variants in 
protein tyrosine phosphatase nonreceptor 
type 11 (PTPN11), son of sevenless homolog 
1 (SOSI), KRAS, BRAF, and neurofibromin 
1 (NFI). These genes are all known or pre- 
dicted to activate the mitogen-activated 
protein kinase (MAPK) signaling pathway, 
which is involved in cancer (9). Therefore, 
established anticancer agents targeting the 
MAPK pathway could be imagined as neu- 
rosurgical adjuncts or even a primary treat- 
ment option for this disorder. 

Recent studies of somatic mutations 
in sporadic arteriovenous malformations 
(AVMs) revealed activating mutations in 
KRAS that can be therapeutically targeted 
(10). AVMs are vascular lesions present in 
the brain that cause hemorrhagic strokes 
by forming abnormal connections between 
arteries and veins, forming a nidus (tan- 
gle) of fragile blood vessels. Currently, sur- 
gery is the only treatment for this condi- 
tion, but surgery is highly challenging, and 
complications such as massive bleeding 
occur frequently. Consequently, interven- 
tions carry high risk of morbidity for the 
patient and require specialized neurosur- 
gical care, which is not readily available 
worldwide. In one study, therapeutically 
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Somatic variation-informed neurosurgery and medical management 


In the future, somatic mutation information could be used to direct neurosurgical and medical management of neurological disorders. Biopsies or surgical resection 
samples could be obtained and processed through next-generation sequencing to identify somatic variants. These variants could then be correlated with clinical 
data such as demographics, neurophysiology, neuroimaging, and medical history to expand research efforts and identify targeted therapies that could be used as 
monotherapy or adjuncts to surgery. This information could also be used for surgical planning if necessary. 


Biopsy 
and surgery 


Demographics 


resected samples from 26 patients with 
AVMs were obtained, and whole-exome se- 
quencing was used to identify somatic mu- 
tations present in at least 1% of cells. Using 
cell-sorting and digital droplet polymerase 
chain reaction-based sequencing, the 
study showed that pathogenic somatic mu- 
tations in KRAS were enriched in brain en- 
dothelial cells. Furthermore, application of 
MAPK kinase (MEK) and phosphatidylino- 
sitol 3-kinase (PI3K) inhibitors to cultured 
endothelial cells down-regulated the KRAS 
signaling pathway, demonstrating poten- 
tial therapeutic efficacy for AVMs. 

The full resection of pathological tissues 
can be difficult or unsafe, particularly in 
nononcologic disorders such as multifo- 
cal epilepsy and neurovascular disease. 
Therefore, to expand the study of somatic 
variants in neurological disease, innovative 
tissue sampling approaches are required. 
To overcome this challenge, neurosurgeons 
have started to create methods that permit 
tissue sampling during common proce- 
dures. These sampling approaches could 
then also be used to inform the pharma- 
cological and surgical approaches that are 
used to treat the disease before attempting 
a full resection. 

The standard diagnosis of a patient with 
treatment-resistant epilepsy involves the 
implantation of stereo-electroencephalog- 
raphy electrodes, which allow intracranial 
recordings of electrical activity, to identify 
the source of epileptogenic activity in the 
brain. In one study, DNA was obtained 
from the electrodes after they had been 
removed from the brain. Analysis of this 
DNA identified pathogenic somatic vari- 
ants in the FCD-related genes AKT3 and 
DEP domain containing 5, GATORI1 sub- 
complex subunit (DEPDC5). The brain re- 
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gions carrying the highest fraction of cells 
with somatic mutations in these genes cor- 
responded to the area of largest epilepto- 
genic activity across individuals (11). This 
study highlights the great clinical potential 
of using electrodes to identify correlations 
between somatic genomic alterations and 
functional brain disruption. The approach 
could be extended to study how somatic 
variants and epileptic activity disrupt tran- 
scription: Highly epileptogenic regions 
could be compared with those with normal 
neuronal activity. More specific analyses 
for detecting somatic variants or designs 
of electrodes that optimize tissue sampling 
might provide higher-definition genomic 
maps of the brain. 

Another promising approach is endovas- 
cular sampling of blood vessel lumens (72), 
although its application to somatic variant 
detection is yet to be explored. This ap- 
proach makes use of diagnostic cerebral 
angiography, which involves the insertion 
of a catheter into the patient’s vasculature 
and injection of dye to visualize blood ves- 
sel morphology. This procedure is com- 
monly used in the diagnosis and staging 
of vascular pathologies from strokes to 
vascular malformations. A guidewire can 
accurately target a specific region of the 
lumen for sampling. In a study of patients 
with AVM, the gene expression profile of 
the lumen samples not only captured dys- 
regulation in the expected MAPK pathway 
but also correlated well with the expres- 
sion profile of the resected AVMs from 
these patients (72). This is a notable find- 
ing because this approach could allow for 
surgical planning based on the molecular 
profile of the vascular pathology and could 
be applied to any pathology for which 
diagnostic cerebral angiopathy is used. 
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However, whether it could yield enough 
DNA for the detection of somatic variants 
remains an open question. 

In addition to congenital or neurode- 
velopmental disorders, somatic mutations 
are being studied in neurodegeneration. 
A study in Alzheimer’s disease performed 
single-cell whole-genome sequencing on 
neurons from post-mortem brain samples 
and found an increased rate of somatic 
mutations in neurons from individuals 
with Alzheimer’s disease compared with 
controls. This increase was partly driven 
by higher amounts of oxidative damage to 
nucleotides (73). Some existing neurosurgi- 
cal approaches are suited to sample tissue 
from individuals with neurodegenerative 
disorders—for example, using the tissue 
along the path of deep-brain stimulation 
electrodes in Parkinson’s disease, or biop- 
sies of the insertion site when implant- 
ing a ventricular shunt in a patient with 
normal-pressure hydrocephalus. However, 
these approaches have not yet been used 
for the detection of somatic mutations. 
Deep-brain stimulation is beginning to be 
used for the treatment of neuropsychiatric 
disorders, and the ability to sample brain 
tissue directly from this patient population 
represents an exciting new frontier in so- 
matic mutation research. 

To assess the somatic mutation landscape 
more fully across neurological disorders, it 
is vital to leverage consortia and biobanks 
to gather large numbers of surgically re- 
sected brain samples. Neurosurgeons and 
scientists from the Cleveland Clinic in the 
United States and the European Brain Bank 
Consortium were recently able to gather 
474 neurosurgically resected brain sam- 
ples from individuals with various forms 
of drug-resistant epilepsy. They performed 
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high-depth whole-exome sequencing to 
identify putative genes associated with 
epilepsy and to compare somatic mutation 
rates across disorders (/4). The use of con- 
sortia could also help to obtain control tis- 
sue that is typically discarded—for example, 
intact blood vessels from resected lobecto- 
mies or nondiseased tissue from wide resec- 
tions. Further efforts of consortia to bring 
neurosurgeons and scientists together have 
the potential to enhance understanding of 
these highly complex disorders (/5). 

The basic and translational research of 
somatic mutations in neurological disor- 
ders will require close collaborations across 
disciplines. From molecular biologists to 
bioinformaticians and clinicians, it is es- 
sential that there is an appreciation of the 
intricacies of the current treatment of these 
disorders. However, as the limits of what 
can be learned from post-mortem tissues 
are reached, it is critical that neurosurgeons 
lead innovation in tissue sampling and data 
analysis to maximize not only how much 
can be learned from patients but also how 
these findings can be effectively translated 
into better outcomes (see the figure). More 
imperative, training specialized neurosur- 
geon-scientists able to wield both classical 
neurosurgical micro-instruments as well as 
new tools, including “molecular scalpels” 
(pharmacotherapy or gene therapies) and 
the computational algorithms required to 
analyze these samples, will further catalyze 
the translation of somatic mutation re- 
search into benefits for patients. ® 
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Increasing policy support for 
reducing racial health disparities 


Perceiving racial health disparities as unjust could catalyze 


or halt change 


By Allison Earl" and Veronica Derricks? 


n the United States, Black Americans 
experience disproportionately negative 
outcomes in domains as varied as wealth, 
employment, and health (7). However, ef- 
forts to obtain support for the policies 
necessary to tackle these disparities have 
had limited success, underscoring a critical 
need for effective interventions to change be- 
havior. On page 1394 of this issue, Brown et 
al. (2) report that highlighting racial health 
disparities was more likely to prompt social 


Information about racial health disparities results in more social media 
engagement than information about other types of racial disparities. 


media engagement and policy support com- 
pared with highlighting disparities in eco- 
nomic measures or belongingness (the feel- 
ing of being an accepted member of a group). 
The authors demonstrated that health dis- 
parities are catalyzing because health ineq- 
uities contravene moral values that should 
never be violated and, consequently, evoke 
feelings of injustice. The premise is intrigu- 
ing and could be used to increase policy sup- 
port for reducing racial gaps. 
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Myriad factors contribute to racial dispari- 
ties in health outcomes at both the interper- 
sonal and structural levels. Interpersonally, 
clinicians often hold racial biases that influ- 
ence clinical decision-making (e.g., chemo- 
therapy or screening recommendations) (3) 
and behaviors (e.g., attempts at relationship 
building) (4). Structurally, policies can nega- 
tively affect Black Americans’ access to and 
quality of health care (e.g., unequal allocation 
of resources, such as vaccines, and limited 
clinic locations and hours) (5). Policies that 
are ostensibly unrelated to health care also 
have widespread effects on 
health outcomes. For in- 
stance, economic and gov- 
ernmental institutions that 
restrict wealth for racially 
minoritized populations 
(e.g., charging higher inter- 
est rates when purchasing 
a home) can shape Black 
Americans’ access to health 
care and _ transportation, 
insurance costs, and finan- 
cial resources available for 
medical treatments (6). 

Despite the persistence 
of health inequities, politi- 
cians, policy-makers, and 
the general public in the 
US have shown little inter- 
est in taking action to ef- 
fectively mitigate these dis- 
parities and may actively 
work to maintain these 
gaps (e.g., continued efforts to repeal the 
Affordable Care Act and restrict Medicaid 
funding) (7). In their work, Brown e¢ al. ran- 
domly assigned Americans to view informa- 
tion (messages and infographics) about ra- 
cial disparities in health, economic factors, 
or belongingness in either experimental (on- 
line) or social media contexts. They found 
that information about health disparities in- 
creased perceived injustice, which enhanced 
social media engagement and support for 
policies to reduce disparities. 

Given these findings, one temptation may 
be to frame all racial disparities in the con- 
text of their impact on health. Yet, decades of 
research, billions of dollars in funding, and 
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the National Institute on Minority Health 
and Health Disparities have been dedicated 
to reducing disparities in health outcomes. 
Although many outcomes have improved 
(e.g., disparities in cancer rates) (3), many 
disparities remain. 

Brown et al’s findings suggest that perceiv- 
ing injustice is key for inspiring support for 
policies designed to ameliorate disparities. 
People are motivated to believe that they live 
in a just society, where individuals get what 
they deserve, and that society has created in- 
stitutions that afford everyone fundamental 
human rights (8). However, perceiving in- 
justice may not always prompt remediation; 
people may manage perceptions of injustice 
by deepening their commitment to systems 
that create and reinforce disparities at the 
expense of minoritized groups (8). For in- 
stance, a sample of mostly white Americans 
reported less support for economic- and 
health-related policies to manage COVID-19 
when the policies explicitly mentioned that 
Black individuals would benefit (9). 

People are also sensitive to social group 
membership and prefer to allocate resources 
to “people like me” at the expense of “people 
not like me” (JO). For example, when faced 
with the notion that their racial group may 
lose social status, white Americans were 
more likely to endorse policies that favored 
their own group at the expense of other racial 
groups (11). White Americans may also dehu- 
manize Black Americans so that the inequi- 
ties are no longer perceived as unjust (12). To 
have the most impact, work aimed at imple- 
menting the findings from Brown e¢ al. may 
need to consider that people are motivated 
to maintain the status quo and may react 
against perceived injustice by doubling down 
on the inequities instead of reducing them. 

Brown et al. examined how people re- 
spond once confronted with injustice; how- 
ever, people might be content to simply 
avoid confronting injustice altogether. For 
instance, white Americans would rather 
avoid racialized information than acknowl- 
edge racial advantages and disadvantages 
(13). Indeed, people prefer to ignore, dero- 
gate, or counterargue messages that chal- 
lenge their current attitudes and behaviors 
rather than be persuaded by them (/4). 
Unless someone is already engaging in 
egalitarian actions or has positive attitudes 
toward reducing racial gaps in outcomes, 
then they may be more likely to ignore any 
message about racial disparities rather than 
acting to remedy them. 

It is also important to consider the perspec- 
tive of audiences who are directly implicated 
by messages describing disparities. For ex- 
ample, receiving information that highlights 
racial disparities in HIV rates reduces Black 
Americans’ attention to the information and 
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lowers their trust in the information provider 
because such information raises concerns 
about being judged by one’s racial identity 
(5). Thus, if interventions do not carefully 
consider the experiences that audiences are 
bringing with them, they may evoke disen- 
gagement and ironically exacerbate the prob- 
lems that they are attempting to remedy. 

Racial disparities in the US persist across 
many important domains. Brown et al. have 
taken up the call to address these dispari- 
ties by designing and implementing a series 
of thought-provoking studies that highlight 
perceived injustice as a lever to motivate ac- 
tion on racial gaps. To foster policy support 
most successfully, researchers must take into 
account how audiences may respond to per- 
ceiving injustice. To build on Brown e¢ al’s 
galvanizing work, scholars could develop 
complementary interventions that address 
barriers that might arise when encountering 
messages highlighting injustice (e.g., people 
not wanting to engage with the messages or 
having little empathy for groups implicated 
by these disparities). This will allow the diffi- 
cult but incredibly important work of reduc- 
ing racial gaps to continue. 
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An unexpected 
role for a 
glutamate 
receptor 


y-Aminobutyric acid acts 
on a glutamate receptor, 
evoking synaptic plasticity 


By lan D. Coombs and Mark Farrant 


on channels activated by the neurotrans- 

mitter glutamate underlie excitatory sig- 

naling between neurons in the brain. In 

mammals, these ionotropic glutamate 

receptors (iGluRs) belong to three fami- 

lies formed from GluA, GluN, and GluK 
subunits, respectively: the a-amino-3-hy- 
droxy-5-methyl-4-isoxazolepropionic acid 
(AMPA), N-methyl-p-aspartate (NMDA), and 
kainate receptors (7). By contrast, receptors 
of a fourth homologous iGluR family—the 6 
or GluD receptors—do not respond to gluta- 
mate. Despite sharing a similar architecture 
with other iGluRs, including a transmem- 
brane pore, the question of whether GluD 
receptors pass current has been controversial 
(2, 3). Instead, they are best known as synap- 
tic organizing proteins (2, 4). On page 1389 of 
this issue, Piot et al. (5) show that one GluD 
family member, GluD1, can bind the inhibi- 
tory neurotransmitter y-aminobutyric acid 
(GABA) and trigger potentiation of GABA- 
mediated synaptic currents. This challenges 
the dogmatic distinction between glutamate 
and GABA receptors and identifies GluD1 as 
a regulator of inhibitory signaling. 

The GluD family members, GluD1 and 
GluD2, can both be found at excitatory syn- 
apses. They connect, through adaptor pro- 
teins of the cerebellin family, to presynaptic 
adhesion molecules of the neurexin family, 
forming trans-synaptic complexes that play 
key roles in synapse assembly and activity- 
driven synaptic modifications (6). These roles 
were first identified for GluD2 at synapses 
made by granule cell parallel fibers onto 
Purkinje cells in the developing cerebellum, 
where the binding of pD-serine, released by 
Bergmanglia, enables long-term depression 
of excitatory transmission by facilitating 
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AMPA receptor internalization (7). GluD1 is 
similarly implicated in trans-synaptic inter- 
actions that have been shown to influence 
the formation of excitatory synapses (8) and 
the balance of AMPA receptor- and NMDA 
receptor-mediated signaling (9). However, 
distinct among iGluRs, GluD1 has also been 
found at GABA-releasing inhibitory synapses, 
specifically, at those between somatostatin- 
positive interneurons and cortical pyramidal 
neurons. GluD1 interactions with cerebellin 
and neurexin, together with the binding of 
p-serine or glycine, are proposed to trigger 
intracellular signals that regulate the assem- 
bly of the inhibitory postsynapse (JO). 

Prompted by these findings, Piot et al. 
investigated the action of GABA on GluD1 
receptors. Using recombinant receptors ex- 
pressed in African clawed frog (Xenopus 
laevis) oocytes, they recorded currents 
from consitutively open GluD1 and GluD2 
“Lurcher” mutants (17). The authors found 
that the GluD1 currents were enhanced by 
both p-serine and GABA, which competed 
for the same binding site. GABA was as ef- 
ficacious as D-serine, albeit with much lower 
potency. By contrast, GluD2 currents were in- 
hibited by p-serine but unaffected by GABA, 
hinting at multiple functional differences 
between GluD1 and GluD2, at least in their 
Lurcher forms. Piot et al. visualized GABA 
binding to GluD1 using x-ray crystallography 
of isolated GluD1 ligand-binding domains, 
which established that key molecular deter- 
minants of the binding are shared by GABA 
and p-serine, consistent with the competitive 
actions observed. Of critical importance for 
their subsequent experiments, the authors 
also identified point mutations in GluD1 that 
were able to abolish GABA binding while 
leaving D-serine binding largely intact. 

To address the question of whether 
GABA could signal through wild-type (non- 
Lurcher) GluD1 receptors at synapses, Piot 
et al. recorded from CA1 pyramidal neurons 
in acute hippocampal slices from mice and 
electrically stimulated the release of GABA 
from neurons in the stratum lacunosum- 
moleculare (where the dendrites of CAI 
pyramidal neurons are located), which is a 
region of intense GluD1 expression. Bursts 
of high-frequency synaptic stimulation, or 
application of p-serine, enhanced the am- 
plitude of inhibitory postsynaptic currents 
(IPSCs) mediated by type A GABA (GABAa) 
receptors (see the figure). These effects were 
occluded by short hairpin RNA-based down- 
regulation of GluD1 expression. Moreover, 
by introducing GluD1 mutants that lacked 
specific signaling capabilities, the authors 
showed that GABA binding and cerebellin 
interactions were required for the enhance- 
ment of IPSC amplitude but the passage of 
ions through GluD1 channels was not. 
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It remains to be determined which of the 
four isoforms of cerebellin (4) participate in 
the inhibitory plasticity described by Piot et 
al. and whether this “non-ionotropic” effect 
of GluD1 requires cerebellin to be engaged 
with presynaptic neurexins. This is a key 
question. GABAergic interneurons are fa- 
mously heterogeneous, and such interactions 
might be expected to determine the cellular 
specificity of the plasticity. Multiple distinct 
interneuron subtypes are found in, or bor- 
dering, the stratum lacunosum-moleculare, 
prominent among which are neurogliaform 
cells (12). These form atypical synapses and, 
unlike most other interneurons, their activa- 
tion results in prolonged GABA elevations 
that reach relatively low peak concentrations. 
Given this, the apparent low affinity of GluD1 
for GABA is particularly intriguing. 

How has the iGluR gating machinery been 
adapted by GluD1 to produce an apparently 
non-ionotropic effect? Of note, calcium ions 
greatly decrease the potency of D-serine at 
GluD2-Lurcher receptors by stabilizing the 
receptor ligand-binding domains in a dimeric 
conformation (13), likely dictating whether 
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ligand binding ultimately engages gating- or 
desensitization-like states. The GluD1 ligand- 
binding domain structures obtained by Piot 
et al. contain calcium ions bound at the di- 
mer interface; whether the functional influ- 
ence of these ions is similar to that seen with 
GluD2, and how this affects the synaptic ac- 
tion of GABA on GluD1, are important issues. 
Some of these questions might be addressed 
by using cryo-electron microscopy to visu- 
alize intact GluD1 receptors in the presence 
and absence of GABA and calcium ions. 

Beyond the immediate mechanics of 
GluD1 “activation, it is not known which 
downstream effector proteins and signaling 
pathways are ultimately key to IPSC poten- 
tiation nor how this potentiation interacts 
with other forms of inhibitory plasticity. 
The low affinity of GABA, and, therefore, the 
presumed brevity of the GluD1 signal (even 
during high-frequency synaptic stimulation), 
suggests a tight linkage between GluD1 and 
its effector. An unbiased proteomic screen 
previously identified several potential GluD1 
interacting proteins (JO), but their roles in 
the newly described GABA-induced plasticity 
remain to be determined. It is interesting to 
note the recent recognition that alternative 
splicing can give rise to GluD1 isoforms with 
different carboxy-terminal cytoplasmic tails 
(14). This raises the possibility of different co- 
horts of GluD1 binding partners and thus the 
involvement of different signaling pathways 
that are potentially dependent on the type of 
synapse or developmental stage. 

In humans, numerous variants of the gene 
encoding GluD1 (GRIDI) have been identi- 
fied. These copy number and missense vari- 
ants are associated with several neurological 
conditions, including schizophrenia, autism, 
intellectual disability, and seizures (J5). 
Whether the disruption of GluD1-dependent 
inhibitory plasticity plays any role in the ef- 
fects of disease-associated GRID] variation is 
an important question for future study. 
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RETROSPECTIVE 


Endel Tulving (1927-2023) 


Explorer, innovator, and theorist of human memory 


By Roberto Cabeza? and Emrah Diizel?4 


ndel Tulving, leader in the field of hu- 

man memory research, died on 11 Sep- 

tember. He was 96. Tulving pioneered 

the study of memory encoding and 

retrieval conditions and first distin- 

guished episodic from semantic mem- 
ory. His work with amnesic patients yielded 
insights into implicit memory, multiple mem- 
ory systems, and the relationship between 
memory and consciousness. Tulving also em- 
braced emerging functional neuroimaging 
techniques, elucidating the roles of frontal 
and parietal lobes in episodic memory. 

Born on 26 May 1927 in Petseri, Estonia, 
Tulving moved to Germany at age 19 and to 
Canada in 1949. He earned his bachelor’s 
and master’s degrees in psychology from the 
University of Toronto in 1953 and 1954, re- 
spectively. After completing his PhD in exper- 
imental psychology at Harvard University in 
1956, he joined the University of Toronto fac- 
ulty. Except for a few years at Yale University, 
he remained there until he retired in 1992. 

Joining the cognitive revolution, Tulving 
challenged the idea of memory as simple 
stimulus-response associations. In 1966, he 
showed that learners impose subjective or- 
ganization onto new information. In 1973, he 
revealed the “encoding specificity principle”: 
Retrieval cues’ capacity to elicit memories de- 
pends on matching encoding conditions. 

Distinguishing semantic memory (general 
knowledge of the world) and episodic mem- 
ory (personally experienced unique events) 
was Tulving’s most far-reaching contribution. 
When he proposed the idea in 1972, he stirred 
intense debate by upending the prevailing 
view that long-term memory was unitary. 
The episodic-semantic distinction has since 
influenced the fields of cognition, neuropsy- 
chology, cognitive neuroscience, and clinical 
research on, for instance, Alzheimer’s disease 
and frontotemporal dementias. 

In the mid-1970s, Tulving began conduct- 
ing neuropsychological research on patients 
with brain injuries. Alongside his graduate 
student Daniel Schacter, he investigated the 
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distinction between explicit (conscious) and 
implicit (unconscious) memory, finding in 
1982 that recognition memory, like other ex- 
plicit memory types, declines over time, but 
priming effects do not. This revelation and 
other evidence led him to propose a multiple 
memory systems theory including episodic, 
semantic, and implicit memory systems. 

In 1985, Tulving associated memory forms 
with types of conscious awareness: proce- 
dural memory (skill learning) is “anoetic” 
(non-knowing), semantic memory is “noetic” 
(knowing), and episodic memory is “autono- 
etic” (self-knowing) awareness. He coined 
the term “mental time travel” to convey that 
autonoetic awareness is akin to reliving a 
past experience. These links to consciousness 
allowed Tulving to address memory tests’ 


inability to isolate one type of memory. He 
introduced the remember/know paradigm, 
in which participants use subjective experi- 
ence to distinguish between two components 
of recognition memory: recollection, which 
truly reflects episodic memory, and familiar- 
ity, which may reflect contributions of other 
memory systems. The remember/know para- 
digm has been widely used since, including in 
patient and functional neuroimaging studies. 

After retirement, Tulving moved to the 
Rotman Research Institute of the Baycrest 
Centre in Toronto and became a visiting pro- 
fessor at the Center for Neuroscience and 
Department of Psychology at the University 
of California, Davis. One of the first memory 
researchers to adopt brain imaging technol- 
ogy, Tulving was intrigued by the possibility 
of determining whether episodic versus se- 


mantic memory, and autonoetic versus noetic 
awareness, are neurobiologically dissociable. 
In 1994, he used positron emission tomogra- 
phy (PET) studies to show the involvement of 
frontal and parietal regions in episodic mem- 
ory. He also noticed that early PET studies 
revealed an intriguing regularity: Episodic 
encoding elicited activations mainly on the 
left prefrontal cortex, but retrieval engaged 
primarily right prefrontal regions. The right- 
lateralization of retrieval activity was linked 
to the mental state of episodic retrieval 
Tulving called “episodic retrieval mode,” 
which was associated with sustained activity. 
Tulving’s novelty-encoding hypothesis high- 
lighted the neurobiological and motivational 
importance of novel events for forming new 
episodic memories. In the late 1990s, he re- 
alized that episodic memory allows mental 
time travel to one’s future as well as past. 

We and our collaborator Lars Nyberg were 
among Tulving’s first postdoctoral students 
at Rotman. At weekly meetings, when post- 
docs presented ideas, Tulving’s input was 
witty and sharp but always friendly. He cel- 
ebrated our successes. During his frequent 
visits to the postdoc office, he would ask us 
about our latest discoveries or what we had 
learned from Mother Nature before telling us 
a dad joke. His humor also infused his sci- 
entific communication. During one lecture at 
an international conference, he had a student 
cross the stage dressed in a gorilla costume to 
make the point that novelty is key to making 
episodic memories endure. Indeed, we can 
still vividly recall this event 25 years later. 

With his ability to communicate across 
disciplines and his openness to unfamiliar 
research fields, Tulving welcomed us into 
cross-disciplinary endeavors in_ episodic 
memory in nonhuman animals, clinical neu- 
rology, and psychiatry. Although at times he 
expressed annoyance when a grant or paper 
was rejected, he would immediately rally and 
try again. He and his wife, accomplished art- 
ist Ruth Mikkelsaar, enjoyed hosting us in 
their home, where we drank Manhattans and 
discussed wide-ranging topics and ideas. 

Tulving’s numerous honors and awards 
include memberships in multiple national 
academies and societies. He also received 
distinguished contribution awards from the 
American Psychological Association and the 
Association for Psychological Science and 
was made an Officer of the Order of Canada, 
the country’s most prestigious civilian honor. 

In addition to his pioneering contribu- 
tions to the science of memory, Tulving will 
be remembered through the students whom 
he trained and guided. They continue to dis- 
seminate his ideas and to train their own 
students in the fundamental principles of 
memory function that he developed. ® 
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Data geopolitics 


A group of scholars explore the implications of how 
governments control and restrict digital information 


By Dov Greenbaum 


n an era of escalating global polariza- 
tion, when jurisdictions enforce data 
sovereignty, imposing governance or 
control over the data produced within 
their borders, “the news you see, the 
facts you see, and even the maps you see 
change depending on where you are,” ob- 
serves legal scholar Mark Lemley 
(1). And with the expanding role 
of artificial intelligence in soci- 
ety, constraints on access to the 
data on which such technologies 
are trained can have severe and 
systemic ramifications. 
In their timely tome Data 
Sovereignty, editors Anupam 


assert that data sovereignty encompasses 
the data protection laws, competition laws, 
and security laws designed to control as- 
pects of data both within and beyond a ter- 
ritory. They describe this concept, deemed 
both “necessary and dangerous,” as the “ex- 
ercise of control over the internet” serving 
as a protective barrier against both foreign 
states and corporations. 

Compared with its relatively 
positive reception in the Global 
North, data sovereignty is often 
perceived in the Global South as 
“the government hijacking the 
Internet to protect itself;’ write 
Chander and Sun. In acknowledg- 
ing that data sovereignty is both a 
necessity for democratic govern- 


Chander and Haochen Sun aim to Rerlarag ct ment and a potential tool to “im- 
provide a comprehensive under- Road to the munize oppression,” they contend 
standing of this issue. The book is Return of the State that digital sovereignty ought to 
divided into four parts, covering  AnupamChanderand be operated within a framework 


digital sovereignty basics, the in- 
tersection of technologies and in- 
stitutions with data sovereignty, 
trade regulations related to data flows, and 
issues specific to data localization. 

A central challenge concerning the anal- 
ysis of data sovereignty is the absence of a 
precise definition for the term. The editors 
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of checks and balances. 
Governments are often moti- 
vated to invoke data sovereignty 
by at least three considerations: a desire 
to protect citizens from illegal content and 
from having their content illegally usurped, 
a desire to promote their jurisdiction’s own 
technology, and a desire to control popula- 
tions by limiting access to incoming and out- 
going data streams. Throughout the book, 
many of the chapters look to the four most 
prominent stakeholders in the area of data 
sovereignty: China, the European Union, the 
United States, and global industry leaders. 


Data inflow and outflow are regulated differently by 
different jurisidictions, affecting what we see online. 


Colloquially known as the “Great Firewall 
of China,” Chinese infrastructure and regu- 
lation showcase extensive state control over 
the inflow and outflow of data. Additionally, 
exported Chinese internet infrastructure 
seeks to project this sovereignty worldwide. 
The European Union employs regulations 
to principally control foreign technology 
firms. In what has been called the “Brussels 
effect,’ the EU aims to use its regulatory 
oversight to expand its influence beyond its 
immediate territory. 

Meanwhile, the United States projects 
soft power through “functional sover- 
eignty” mediated by its dominant technol- 
ogy and social media companies. Within 
the US itself there has historically been 
minimal government control over data 
flow owing to the country’s concentration 
of corporate powers, its expansive speech 
protections, the nature of its intellectual 
property laws, its relatively weak privacy 
laws, its ineffective domestic regulatory 
bodies, and the severe limitations placed 
on corporate regulation imposed by inter- 
national groups. 

Unencumbered by governance, private 
US corporations that make the market and 
manage data can thus also promote and 
censor information as they see fit, setting 
up what philosopher Luciano Floridi has 
referred to as a clash “between companies 
and states” (2). Growing recognition of this 
problem has led to bipartisan support for 
the idea that the internet ought to be more 
heavily regulated before it undermines 
democracy. Although the book mentions 
some of the legislative efforts that have 
been proposed, given the speed of change 
in this area, this analysis is already some- 
what dated. 

Individual citizens ought to also have 
control over their personal data (“self- 
sovereignty”). However, this control, when it 
is granted and ostensibly managed through 
consent and oversight, is frequently illusory 
because even seemingly insignificant bits of 
information can be used to deduce highly 
private details as a result of advances in 
data processing and artificial intelligence. 

Ultimately, this volume provides an in- 
sightful exploration of data sovereignty, 
shedding light on the multifaceted chal- 
lenges and opportunities that lie ahead in 
the evolving landscape of data governance. 
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Making a digital dupe of Anderson Cooper 


Two computer scientists discuss deepfakes and how they made a facsimile of the anchor 


By Jamie Dickman 


n 3 December 2023, CNN aired an episode of The Whole Story 
with Anderson Cooper that featured a computer-generated fac- 
simile of Cooper. University of California, Berkeley, computer 
scientist Hany Farid and Stanford University undergraduate 
Maty Bohacek worked with CNN to create the virtual news an- 
chor. The project began as an exercise in learning how deep- 
fakes—digitally created likenesses—could be assembled, but the pair 
quickly realized that they had all the resources necessary to render a 
fully synthetic Cooper. They completed the project within the span of 
2 months using primarily open-source software. The final product’s 


realism has worrying implications for the trustworthiness of visual 
media and television, not to mention the livelihoods of onscreen per- 
formers, the latter of which has been a point of contention in recent 
Hollywood strikes. 

“My lab here at Berkeley is not in the business of creating fakes,” 
said Farid, a cybersecurity expert. “We’re in the business of detecting 
them, and if you want to be in the business of detecting stuff, you’ve 
got to know how [it’s] built.’ We spoke with Farid and Bohacek about 
the project and the broader phenomenon of generative artificial in- 
telligence (AI). This interview has been edited for brevity and clarity. 


Q: How did this project get started? 
Farid: I did a piece with Donie 
O’Sullivan, who is a reporter for 
CNN and works on the Anderson 
Cooper show, where we cloned his 
[Donie’s] voice and then we called 
his parents with this AI voice, and 
they couldn’t tell. 

We were screwing around clon- 
ing people’s voices, and we cloned 
Anderson Cooper’s voice. And at 
some point in that conversation, I 
said, “I think we might be getting 
close to having all the pieces we need 
to create a full-blown version of a 
virtual Anderson Cooper.” 


Q: Can you walk me through how you 
create a video like this? 
Farid: So, there are actually three 
parts: the text, the audio, and the 
video. They wouldn’t let us generate 
a script for Anderson Cooper. That’s 
not surprising, right? The guy’s on national 
television. So, the first thing we’re going to 
do is generate Anderson Cooper saying [the 
CNN script]. That’s all open source code. 
Bohacek: [To create the video, we need to] 
identify where the head is in the footage, 
and then re-render the lower area of the 
face, so that the mouth opens and closes in 
the way that the words are spoken in the 
audio. You need to do a little bit of tweak- 
ing here and there, but the problem with 
most of these technologies is that they’re 
very low-res. 

The second part is making it look 
sharper and enhancing the frames in the 


The interviewer is at the Medill School of Journalism, 
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SCIENCE science.org 


Computer scientists created a virtual likeness of Anderson Cooper, 
who is shown here at a 2016 Presidential Primary Debate. 


video that are already synced with the au- 
dio. We do some preprocessing, some post- 
processing, but that’s basically it. 


Q: How much expertise or training would 
someone need to do this? 

Bohacek: It was a lot of time and a lot of 
effort, but it was just putting the pieces 
together. I think that anyone with decent 
Python skills and the dedication could do 
that at some point. 

Farid: I think right now you need some- 
body like Maty; he’s got some good engi- 
neering skills and good coding skills. He’s 
got years of programming skills. But here’s 
the thing: About every 6 to 12 months, it 
gets twice as good, twice as cheap, and 
twice as easy to use. So, even if the average 
Joe can’t do it today, they’re going to even- 
tually be able to do it. 


Q: Are you concerned that someone 
could use what they’ve learned from 
the segment to create Al performers for 
nefarious reasons? What steps did you 
take to prevent this? 

Farid: We went back and forth on 
“Should we publish how we did this?” 
We landed on “no.” We're not going 

to tell people the series of things that 
we had to do to get this to really work 
because we felt like the downside 
outweighed the benefits. There was 
no real upside. I don’t think it’s like 
we know something that nobody will 
ever figure out. [But as] anybody who 
does cybersecurity will tell you: You 
find a vulnerability, you need to tell 
the world. 


Q: What responsibilities do technolo- 
gists have to ensure that this technol- 
ogy is deployed ethically? 

Farid: Physicists, chemists, biologists— 
they’ve been dealing with this stuff forever. 
They have had time to think about the ethics 
of stem cell research, of cloning, of develop- 
ing things that can be weaponized. Our com- 
munity doesn’t have that culture. That’s why 
I think we are struggling a little bit. 

If you look at the early days of the in- 
ternet, it was very much this libertarian 
view that technology wants to be free and 
that regulation and government [should 
be] hands-off. And I think that’s a spec- 
tacularly naive way of thinking about the 
world. Yet, we do that with technology 
all the time because we have this discon- 
nect between technology—bits, pixels, and 
bytes—and the physical world. There is no 
more online world and offline world; that 
distinction vanished a long time ago. & 
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lran’s Zayandeh Rud River 
basin in crisis 

The Zayandeh Rud River, the largest river 

in Iran’s central plateau (J), has held stra- 
tegic importance for centuries because of 

its location. Although the river was added 

to Iran’s natural heritage list in 2011 (2), for 
the past two decades, imprudent actions by 
the government and residents have led to 
environmental and management problems 
in the river’s basin (3). Between 1998 and 
2018, the Zayandeh Rud River’s average 
flow decreased by 1 million cubic meters 

(4). Given that Iran, situated in a semiarid 
region, consistently grapples with water 
scarcity, the government should work to pro- 
tect its most important river. 

The Zayandeh Rud’s precarious state has 
resulted from unsustainable water use in 
the region. Unauthorized upstream pump 
installations, particularly in the past decade, 
have led to indiscriminate water extraction 
by local residents, triggering water crises (5). 
Downstream, water-rich crops, expansive 
farming in unsuitable conditions, and the 
absence of modern agricultural technologies 
have intensified the problem (6). Meanwhile, 
water-intensive steel industries and oil 
refineries divert water resources. Water 
mismanagement, exacerbated by inadequate 
governance, a lack of monitoring, and a 
shortage of skilled professionals, has pushed 
the country to the brink of crisis. 

Iran has breached its Ramsar Convention 
obligations (7) by allowing unauthorized 
water diversions and violating regulations 
that protect wetlands. These activities have 
caused 90% of the Zayandeh Rud to dry up 
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(8). This extreme desiccation has resulted in 
the Gavkhooni wetland drying up, ecosystem 
disruption, soil degradation, and decreases 
in water and air quality (9). 

Iran has taken some steps to protect the 
river and basin. The government established 
the Zayandeh Rud River Quality Protection 
Council (10) to prevent pollution and 
improve the management and water qual- 
ity of the river, and it has enacted laws that 
require the fair distribution of water and 
outline a nine-point plan for basin restora- 
tion (7). However, these measures are inad- 
equate. In alignment with environmental 
rights principles (72), addressing the chal- 
lenges in the Zayandeh Rud water basin will 
involve prioritizing the environment, pre- 
venting illegal upstream harvesting, ensur- 
ing transparency in water usage across the 
basin, focusing on land use and integrated 
management, transitioning from traditional 
to modern and greenhouse agriculture, 
reforming water governance, and transfer- 
ring high-demand water industries to other 
cities. Implementing these measures would 
effectively address the complex and critical 
situation in the Zayandeh Rud water basin. 
Iman Salahshoori! and Majid Namayandeh 
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Aman walks along a dried-up Zayandeh 
Rud River riverbed in Isfahan, Iran. 
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Restore the Yangtze 
River’s floodplains 


In 2020, China implemented a 10-year fish- 
ing ban in the Yangtze River, a substantial 
aquatic biodiversity conservation initiative 
(1). Although the ban has offered protec- 
tion to Yangtze species, habitat destruction 
prevents China from maximizing its benefits 
(2). The loss of aquatic habitats, especially 
floodplains, is now a more substantial driver 
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of biodiversity loss in the Yangtze River than 
illegal hunting and overfishing (3). 

Both fish stock and biodiversity have 
shown signs of recovery in the years since 
the ban took effect (2). In 2022, 25 fish spe- 
cies were collected that were not found in 
2020, and overall fish density in the Yangtze 
River had increased by 20% (4). Populations 
of the iconic and endangered Yangtze fin- 
less porpoise (Neophocaena asiaeorienta- 
lis asiaeorientalis) have experienced an 
unprecedented increase from 1012 in 2017 to 
1249 in 2022 (4). 

Floodplains—seasonally inundated areas 
along the river and connected lakes—are 
invaluable seasonal breeding and spawn- 
ing habitats for aquatic species (5), espe- 
cially for the Yangtze finless porpoise (6). 
However, anthropogenic activities such 
as dam construction, urbanization, and 
agricultural expansion have led to rapid 
floodplain loss. Some floodplains have 
become permanently dry, some are now 
permanently underwater but inaccessible 
to breeding species, and some have been 
lost entirely through erosion. The area of 
seasonal floodplains in the main stem of 
the Yangtze River has shrunk to less than 
3.3% of the floodplain area that existed in 
the 1970s (7, 8). Maintaining the population 
size of fish in the Yangtze River requires a 
floodplain area of 14,400 km2, but the total 
current area is only about 5500 km2 (9). 
The construction and operation of cascad- 
ing reservoirs in the upper reaches of the 
Yangtze River could further exacerbate the 
decline of Yangtze seasonal floodplains (2). 

The 10-year fishing ban is an important 
step for the ecological restoration of the 
Yangtze River, but its effectiveness depends 
on the restoration of seasonal floodplains. 
In the middle and lower reaches of the 
Yangtze River, the central government 
has built embankments on each side, and 
local governments have built additional 
“sub”-embankments, as well as floodgates 
between the river and connected lakes. 

The function of sub-embankments and 
gates was to control flooding, but upstream 
dams and the primary embankment now 
serve this purpose. Demolishing the sub- 
embankments and gates would allow 
floodwaters to flow freely within the main 
embankments, which would restore the 
floodplains and eliminate human develop- 
ment activities without increasing the risk 
of flooding beyond the river. The Yangtze 
River Protection Law, the country’s first 
legislation on a river basin, went into effect 
in early 2021. The river’s shoreline is strictly 
protected under the law, laying the founda- 
tion for floodplain recovery (10). 

Zhigang Mei!, Kexiong Wang!, Hao Dv, Peilin 
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Pakistan’s turtle species 
at risk of extinction 


Pakistan’s five species of marine turtles, 
eight species of freshwater turtles, and two 
species of land turtles face numerous threats 
(1). The Indian narrow-headed softshell 
turtle (Chitra indica) is listed as Endangered 
under Appendix II of the Convention on 
International Trade in Endangered Species 
(CITES) (2). Olive ridleys (Lepidochelys 
olivacea), once common, have not been 
observed nesting in Pakistan since 2004 (7). 
If Pakistan does not take steps to mitigate 
the risks to wild turtles, they could face 
imminent extinction. 

Human activities have degraded turtle 
habitats. Unchecked urban growth and 
increasing pollution have led to habitat 
loss (1). Rapid coastal development has 
encroached on once pristine sea turtle nest- 
ing sites (3). Climate change also threatens 
turtle habitats through increasing tempera- 
tures and rising sea levels (4). 

Fishing poses another threat to Pakistan’s 
turtles. Approximately 30,000 sea turtles 
are accidentally entangled in tuna gillnets 
each year (5). Although most turtles in the 
region are herbivorous, and others feed on 
insects, crustaceans, and very small fish 
(6), some fishers kill turtles intentionally 
because they incorrectly believe that turtles 
prey on large fish (7). 


The illegal turtle trade from Pakistan to 
China and East Asia has led to substantial 
decreases in wild turtle populations (8). 
Demand for turtles is driven by their cul- 
tural associations with luck and longevity 
(9). They are kept as pets and consumed as 
food and as components of traditional medi- 
cine. In 2015, five confiscated shipments 
included 1.9 tons of turtle body parts and 
1345 live freshwater turtles (70). 

Turtle conservation requires the collec- 
tive efforts of governments, conservation 
groups, and local communities. Community- 
based initiatives, sustainable practices, and 
increased awareness about the impact of cli- 
mate change and pollution on turtle habitats 
could mitigate further habitat loss. Because 
many turtles are found in surface waters 
(11), fishermen should switch from surface to 
subsurface gillnetting to reduce turtle entan- 
glement. Educating the public about turtle 
conservation could reduce intentional culls. 

Although exporting turtles is forbid- 
den according to CITES (12), smuggling is 
enabled by lucrative rewards and Pakistan’s 
weak laws and lenient penalties. To thwart 
trade, Pakistan must strictly enforce the 
CITES ban on turtle export and implement 
strict anti-wildlife trade measures. The gov- 
ernment should install dogs at exit points 
to curb trade at airports and seaports by 
detecting the turtles’ scent. Because Pakistan 
lacks reliable turtle data, the government 
should also conduct a systematic survey 
to lay the foundation for conservation and 
future research. 
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TENDINOPATHY 
Tendon repair gets up its nerve 


endons have peripheral nerve inputs for sensory feedback; 
however, the role of these nerves in tendon injury and repair 
remains unclear. Cherief et al. demonstrated that an 
increase in peripheral nerve growth occurs after Achilles 
tendon injury in mice. Experimental blockade of nerve 

growth resulted in decreased tendon healing and reduced 

expansion of tendon sheath progenitor cells. Human 

tendon tissue imaging and transcriptomics cor- 

roborated the relationship between nerve growth r 

and tendon healing. Further, activating the a 

nerve growth factor receptor TrkA enhanced 

nerve responses to tendon injury, boosted Ji 

progenitor cell expansion, and improved 

metrics of tendon repair. -—MEO 


Sci. Transl. Med. (2023) 
10.1126/scitranslmed.ade4619 4 


ASTROCHEMISTRY 
Ryugu PAHs formed 
in interstellar clouds 


Astronomical observations 
show that polycyclic aromatic 
hydrocarbons (PAHs) are 
abundant and widespread in 
the interstellar medium. PAH 
molecules consist of several 
adjacent aromatic rings termi- 
nated by hydrogens. Zeichner et 
al. performed laboratory isoto- 
pic analysis of PAHs in samples 
of the asteroid Ryugu that were 
collected by the Hayabusa2 
spacecraft. They found that two 
18¢ nuclei occurred in the same 
molecule more often than would 
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be expected if C were ran- 
domly incorporated into PAHs. 


This isotope fractionation effect 


occurred during formation at 
temperatures under 100 kelvin. 
The authors argue that at least 
some of the Ryugu PAHs must 
have formed in cold interstellar 
clouds and thus predate the 
Solar System. —KTS 


Science, adg6304, this issue p. 1411 


SUPERCONDUCTIVITY 
Twisted superconductors 


Twisted two-dimensional (2D) 
structures exhibit a wealth of 
interesting behaviors. Recently, 
the 2D materials used in such 


superlattices have largely been 
graphene and various transition 
metal dichalcogenides. However, 
cuprate superconductors can 
also be thought of as 2D materi- 
als, with the copper-oxide planes 
coupled to each other through 
Josephson coupling. Zhao et 

al. created high-quality twisted 
structures of a bismuth-based 
cuprate by cleaving a single 
exfoliated crystal and placing 
the two halves on top of each 
other at various angles. Near the 
twist angle of 45°, Josephson 
coupling was suppressed by the 
d-wave superconducting order 
parameter of the material. The 
researchers also observed a 
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Healing of injuries 

to the Achilles tendon, 
shown in a computed 
tomography image, involve 
increased growth 

of peripheral nerves. 


Josephson diode effect caused 
by the breaking of time reversal 
symmetry. —JS 

Science, abl8371, this issue p. 1422 


SOLAR CELLS 
Optimizing nickel 

oxide transporters 
Inverted perovskite solar 
cells have advantages in 
manufacturing, but their buried 
hole-transport layer (HTL) 
often limits power conver- 
sion efficiency (PCE). Yu et al. 
showed that treatment of a 
nickel oxide nanoparticle HTL 
with hydrogen peroxide made 
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the particles smaller and more 
uniform and also created Nis* 
surface groups that improved 
bonding to organic phosphonic 
acid self-assembled monolayers. 
A device with 25.2% certified 
PCE was fabricated and retained 
85% of its initial efficiency after 
500 hours of accelerated aging 
at 85°C. —PDS 

Science, adj8858, this issue p. 1399 


An asymmetric kinase 
complex 


Mutations in leucine-rich 
repeat kinase 2 (LRRK2) are 
implicated in the development 
of Parkinson's disease, so this 
enzyme is a target for therapeu- 
tics currently in clinical trials. 
Zhu et al. determined the struc- 
tures of LRRK2 bound to Rab29, 
a membrane-associated protein 
thought to target the kinase to 
specific organelles, and found 
that the resulting complexes 
adopted several oligomeriza- 
tion states. The largest complex 
included four copies of LRRK2 in 
two different conformations: one 
was similar to the inactive state 
seen previously and the other 
represented a proposed active 
state. Functional experiments 
and work with mutants pro- 
vide insight into the structural 
features of this complex and 
may inspire new strategies for 
therapeutic inhibition. -MAF 
Science, adi9926, this issue p. 1404 


Fast and easy 

The numerical models used 

to predict weather are large, 
complex, and computation- 

ally demanding; they also do 
not learn from past weather 
patterns. Lam et al. introduced 
a machine learning—based 
method that has been trained 
directly from reanalysis data of 
past atmospheric conditions. In 
this way, the authors were able 
to quickly predict hundreds of 
weather variables globally up to 
10 days in advance and at high 
resolution. Their predictions 
were more accurate than those 
of traditional weather models 
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in 90% of tested cases and 
displayed better severe event 
prediction for tropical cyclones, 
atmospheric rivers, and extreme 
temperatures. —HJS 

Science, adi2336, this issue p. 1416 


Inspired by polar bears 
Polar bears have fur that has 
a porous core encapsulated in 
a dense shell, a structure that 
keeps these animals warm and 
dry but is relatively lightweight. 
Wu et al. were inspired by this 
structure in developing an 
encapsulated aerogel fiber (see 
the Perspective by Sheng and 
Zhang). The synthetic fibers 
have mechanical properties that 
allow for weaving and knitting 
while at the same time being 
excellent thermal insulators. 
They can also be produced at 
scale with a simple two-step 
freeze-spinning and encapsula- 
tion process. —BG 

Science, adj8013, this issue p. 1379; 

see alSo adm8388, p. 1358 


An immune atlas of 
developing human lungs 


From birth, the airways provide 
protection against respiratory 
pathogens and inhaled toxins, 
but little is known about the 
early development of lung 
immune cells. Using single-cell 
transcriptomics, Barnes et al. 
characterized human embryonic 
and fetal immune cells in the 
developing lungs between 5 and 
22 weeks after conception. All 
stages of B cell development 
were detected, suggesting that 
fetal lungs provide a local niche 
for B cell maturation. Myeloid 
cells were widespread, includ- 
ing near epithelial tips, and 
produced interleukin-18, which 
induced epithelial stem cell 
differentiation into basal cells 
within fetal lung organoids. This 
work provides an immune atlas 
of developing human lungs and 
suggests a role for fetal immune 
cells in guiding the development 
of the lung epithelium. —CO 
Sci. lmmunol. (2023) 
10.1126/sciimmunol.adf9988 
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It takes a neighborhood 


to raise a voter 


Chyn and Haggag studied 

nearly 6000 children living in 
low-income public housing in 
Chicago. In one-third of them, 
their families had been forced 

to relocate when their buildings 
were slated for demolition dur- 
ing 1995 to 1998 and received 
government support to move 

to higher-income, lower-crime 
areas. Relocated children were 
10% more likely to vote in general 
elections during 2008 to 2016 
compared with their nonrelo- 
cated counterparts. Results were 
driven largely by females and by 
those who relocated at earlier 
ages (5 to 11 years old) and 


appear to be linked to increased 
education and earnings among 
those who relocated. —BW 
Rev. Econ. Stat. (2023) 
10.1162/rest_a_01207 


Tau moves vesicles 
Neuronal communication relies 
on release of neurotransmitter- 
containing synaptic vesicles 
(SVs) from the presynaptic 
terminals. The different mobility 
of the two major types of SVs 

is critical for correct neuronal 
communication, but how this 
happens is unknown. Longfield 
et al. have found that the protein 
tau, known for its role in neurode- 
generative diseases, controls the 
mobility of SVs. Tau molecules 


science.org SCIENCE 
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form transient nanobiomolecu- 
lar condensates at presynaptic 
terminals, which selectively 
regulates the mobility of SVs. 
Thus, tau dysregulation during 
neurodegeneration might affect 
SV mobility and impair synaptic 
transmission. -MMa 
Nat.Commun. (2023) 
10.1038/s41467-023-43130-4 


Accelerating 
photoreaction discovery 


In recent years, efficient high- 
throughput screening methods 
have been developed that 
should substantially accelerate 
the rate of discoveries in vari- 
ous fields of organic chemistry. 


SCIENCE science.org 


What's that growing 
under the bed? 


ushrooms that produce the psy- 
choactive compound psilocybin 
have been cultivated for decades, 
although the unlawful nature of their 
use has resulted in the origins of 
many strains being obscured. McTaggart et 
al. sequenced wild populations of Psilocybe 
cubensis in Australia and compared them 
with 86 sequenced commercial cultivars. P 
cubensis seems to have been introduced to 
Australia with livestock in the 19th century, 
where it grew abundantly on dung and was 
accidentally foraged for food. Now, natural- 
ized populations show far greater diversity 
compared with cultivars, which show evi- 
dence of genetic bottlenecks within the past 
50 years and may not have sufficient fitness 


However, organic photochem- 
istry lags due to challenges 
associated with excited-state 
chemical reactions. Bain et 
al. present a system for rapid 
screening of photochemi- 
cal systems using transient 
absorption spectroscopy, as 
demonstrated using pyrimidine 
nucleosides. The proposed 
framework is general and can be 
extended to other excited-state 
spectroscopic techniques. It 
offers a promising experimental 
tool for photochemical reac- 
tion discovery, complementing 
recently proposed machine 
learning—based computational 
tools for modeling excited-state 
dynamics. —YS 
J. Phys. Chem. Lett. (2023) 
10.1021/acs.jpclett.3c02389 


to thrive outside of cultivation. However, 
there is little evidence that domestica- 

tion has improved the psychedelic effects 
of P cubensis; rather, it has made them 
easier to grow in basements or under beds. 
Psilocybin now is finding legitimate uses for 
treating serious mental health disorders, 
and its cultivation is thus of considerable 
commercial interest. —CNS and CA 

Curr. Biol. (2023) 

10.1016/j.cub.2023.10.059 


Spores of P. cubensis (seen here in “psychedelic” 
colors) have been collected for clandestine magic 
mushroom cultivation over the past 50 years. 


Putting more omics 
together 


Recent years have seen major 
advances in the amount of 
information that can be gleaned 
from genomic testing of tissue 
samples. Single-cell genomic 
methods were invented and 
then refined or combined into 
multi-omics analyses featuring 
different types of data. Yang et 
al. present the next advance in 
this progression, with a compu- 
tational method called Spatrio, 
which combines single-cell 
multi-omics with spatial data. 
This method is likely to yield 


valuable insights into several bio- 


logical processes, as the authors 
demonstrate by examining 


tissues as different as those in 
the brain, liver, and breast, some 
in a healthy state and others in 
diseases such as steatosis or 
cancer. —YN 
Cell Genom. (2023) 
10.1016/j.xgen.2023.100446 


Earth’s early crustal 
water source 


Archean cratons are what 
remains of the very first conti- 
nents formed on Earth. However, 
forming these continents 
requires the proper chemistry 
and dynamics, with a water 
source in the mantle being 
one of the hardest problems 
to solve. Wu et al. hypothesize 
that a water-rich magma ocean 
may have overturned after the 
moon-forming impact, sup- 
plying sufficient water to the 
lithosphere to generate Earth's 
earliest continental crust. This 
hypothesis could resolve the 
question of the source of the 
water while also explaining other 
features of very early crustal 
growth. —BG 

Geophys. Res. Lett. (2023) 

10.1029/2023GL105178 


One anchor to bind 
them all 


Classically, malaria is diag- 
nosed microscopically in blood 
samples. Despite its familiarity 
and size, malaria parasite cell 
biology is not well understood. 
Liffner et al. used ultrastructural 
expansion light microscopy to 
prepare physically enlarged 
samples in which immunofluo- 
rescent markers can be used to 
explore parasite anatomy. This 
technique revealed that during 
mitosis, developing organelles 
such as the rhoptries (specialist 
structures required for host cell 
invasion), Golgi, and prefission 
mitochondria anchor to the 
parasite’s plasma membrane by 
means of the centriole plaque 
and its associated proteins. 
—CA 
Elife (2023) 
10.7554/eLife.88088.2 
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BITCOIN 
Cryptocurrency 
skepticism 
After El Salvador adopted the 
cryptocurrency bitcoin as legal 
tender, its government incen- 
tivized it with the Chivo Wallet 
app, a digital wallet backed by 
their central banking system. 
Despite major incentives, bitcoin 
was rarely adopted except 
among one privileged group: 
young, educated men with 
bank accounts. Compared with 
traditional banks, bitcoin offers 
improved privacy and transpar- 
ency and rapid, inexpensive 
transactions. These are all fea- 
tures that economists assumed 
were desirable for Salvadorans 
because most are unbanked, 
need remittances, and can 
access Chivo Wallet because 
they usually own phones with 
Internet. However, Alvarez et 
al. found that Salvadorans’ 
preference for tangible cash and, 
ironically, privacy and transpar- 
ency fears impeded bitcoin 
adoption. Unless populations 
are financially literate and trust 
virtual currencies, policies 
incentivizing their adoption may 
fail. —EEU 

Science, add2844, this issue p. 1375 


NEUROSCIENCE 
Glia and negative memory 
formation 


Fruit flies can be taught to asso- 
ciate specific odors with painful 
electrical shocks. However, how 
these associative memories are 
formed and stored in their brains 
is not fully understood. Miyashita 
et al. unexpectedly discovered 
that ensheathing glia, the cells 
that wrap around the mush- 
room bodies and are the flies’ 
main neural processing centers, 
release glutamate from synaptic 
vesicles. The glutamate released 
binds to NUDARs on mushroom 
body neurons and kainate recep- 
tors on dopaminergic neurons. 
NMDARs require both gluta- 
mate binding and simultaneous 
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cellular depolarization to open. 
Functional imaging experiments 
demonstrated that NUDARs 
permit the transmission of shock 
information to the mushroom 
bodies only when flies are 
exposed to an odor and shock 
simultaneously, but not when 
flies are exposed to shocks 
alone. —PRS 

Science, adf7429, this issue p. 1376 


TRANSCRIPTION 
Visualizing transcription 
initiation 
In eukaryotic cells, general 
transcription factors (GTFs) sup- 
port RNA polymerase (Pol) II to 
assemble a preinitiation complex 
on acore promoter, open double- 
stranded DNA, and start RNA 
synthesis. Chen et al. determined 
the structures of 16 de novo 
transcribing complexes, TC2 to 
TC17, containing nascent RNAs of 
2 to 17 nucleotides, respectively. 
Connecting these structures 
revealed an abrupt transi- 
tion from initially transcribing 
complexes (in TC2 to TC9, GTFs 
remained bound to the promoter 
and Pol Il) to early elongation 
complexes (in TC10 to TC17, GTFs 
dissociated from the promoter). 
This transition was accompanied 
by remarkable conformational 
and compositional changes, 
including transcription bubble 
collapse and GTF dissociation, 
which allowed Pol Il to escape 
the promoter for product RNA 
synthesis. —DJ 

Science, adi5120, this issue p. 1377 


SIGNAL TRANSDUCTION 
Mapping receptor 
pharmacology 


G protein-coupled receptors 
such as the beta-adrenergic 
receptor for adrenaline are key 
targets for therapeutic drug 
development. Heydenreich et al. 
undertook systematic muta- 
tion of each receptor residue to 
determine which are important 
for receptor signaling (see the 
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Perspective by Filizola and 
Javitch). The authors evaluated 
the contribution of each residue 
to the fundamental pharma- 
cological properties of efficacy 
and potency of the endogenous 
receptor ligand adrenaline and 
found that one-fifth of resi- 
dues contributed to efficacy or 
potency of signaling by the 
receptor. Some mapped to sites 
of ligand or G protein binding, but 
more than two-thirds did not. The 
important sites appeared to work 
together in an allosteric network 
that led to structural changes 
and signaling responses. Such 
detailed information may help to 
guide the development of new 
ligands that produce the desired 
signaling responses. —LBR 
Science, adh1859, this issue p. 1378; 
see also adm8393, p.1357 


CHEMISTRY AUTOMATION 


Adye design loop 
As chemists make increasing 
use of automated equipment 
and predictive synthesis algo- 
rithms, an autonomous research 
device is coming closer to real- 
ization. Koscher et al. developed 
a prototype platform for explor- 
ing the prospective dye-like 
properties of several polycyclic 
small-molecule classes. By 
combining machine learning— 
enabled planning and analysis 
with robotic reaction modules, 
the apparatus iteratively syn- 
thesizes variously substituted 
compounds to optimize their 
absorption properties, lipophilic- 
ity, and photo-oxidative stability. 
The authors elaborate on when 
and why human input still comes 
into play. —JSY 

Science, adil407, this issue p.1374 


SOCIAL DISPARITIES 
How to motivate 
mitigation efforts 


Racism in America is a system 
of oppression that produces 
and sustains racially unequal 
outcomes across education, the 
economy, health, incarceration, 


and other institutions. These 
intertwined institutions place 
Black Americans at a com- 
pounded disadvantage even 
in the absence of overt dis- 
crimination. An effective way 
to challenge systemic racism is 
through social movements or 
policies. However, Black-White 
disparities in some domains 
(e.g., poverty and alienation) 
may not mobilize resistance 
toward racism as much as 
disparities in other domains 
(e.g., health and physical 
harm), although all domains are 
pernicious and reinforce each 
other. Brown et al. found that 
threats to health or body (e.g., 
aggressive police encounters) 
spark outrage and motivate 
more action because they 
trigger morality concerns (see 
the Perspective by Earl and 
Derricks). Framing disparities to 
engage moral injustice feelings 
may better motivate policy 
reform to dismantle systemic 
racism. —EEU 

Science, adh4262, this issue p.1394; 

see also adm7199, p. 1362 


ICE SHEETS 
Many-armed record 
of collapse 


How the West Antarctic Ice 
Sheet (WAIS) responded to 
warmer climates in the geologic 
past has obvious relevance 

to our understanding of what 

its future could be as global 
temperatures rise due to human 
activities. Using genetic analy- 
ses of a type of circum-Antarctic 
octopus, Pareledone turqueti, 
Lau et al. showed that the WAIS 
collapsed completely during the 
last interglacial period, when 
global sea levels were 5 to 10 
meters higher than today and 
global average temperatures 
were only about 1°C warmer 
(see the Perspective by Dutton 
and DeConto). The implication 
of this finding is that major 
WAIS collapse and the conse- 
quent rise in sea level could be 
caused even by the minimal 
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temperature rises projected for 
stringent climate change mitiga- 
tion. —HJS 
Science, ade0664, this issue p. 1384; 
see also adm695/, p. 1356 


NEUROSCIENCE 
Excitatory receptor type 
also inhibits 


Glutamate delta receptors 
ook like ionotropic glutamate 
channels but do not behave 
ike classic neurotransmitter 
receptors because of their 
insensitivity to glutamate and 
inability to gate the ion channel 
pore upon ligand binding. Piot 
et al. found that GluD1, the 
east characterized member 
of this receptor family, instead 
binds GABA, the main inhibitory 
neurotransmitter in vertebrates 
(see the Perspective by Coombs 
and Farrant). GABA binds to the 
igand-binding domain of GluD1 
and drives conformational 
rearrangements in the receptor. 
GluD1 receptors localize at hip- 
pocampal GABAergic synapses, 
where they regulate the efficacy 
of GABAergic transmission. 
Activation of GluD1 receptors 
at hippocampal GABAergic 
synapses induces inhibitory 
synaptic plasticity. —PRS 
Science, adf3406, this issue p. 1389; 
see also adm6771, p. 1363 


DNA DAMAGE 
Surviving DNA damage 
with IRAKs 


The kinase IRAK1 promotes 
radiotherapy resistance in 
cancers. Li et al. found that 

this function of IRAK1 was 
independent of its regulation 
by innate immune receptor 
signaling. Double-stranded DNA 
breaks induced by radiation 
triggered IRAK4-dependent 
activation and nuclear translo- 
cation of IRAK1, which bound to 
and inhibited a pro-apoptotic 
complex, thereby supporting 
cell survival. This mechanism 
required the DNA damage 
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response kinase ATR and 
Pellino family ubiquitin ligases. 
Disrupting the pathway sensi- 
tized p53-mutant zebrafish to 
radiation. —LKF 
Sci. Signal. (2023) 
10.1126/scisignal.adh3449 
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CHEMISTRY AUTOMATION 


Autonomous, multiproperty-driven molecular 
discovery: From predictions to measurements and back 


Brent A. Koscher}, Richard B. Canty{, Matthew A. McDonald}, Kevin P. Greenman, Charles J. McGill, 
Camille L. Bilodeau, Wengong Jin, Haoyang Wu, Florence H. Vermeire, Brooke Jin, Travis Hart, 
Timothy Kulesza, Shih-Cheng Li, Tommi S. Jaakkola, Regina Barzilay, Rafael GGmez-Bombarelli, 


William H. Green, Klavs F. Jensen* 


INTRODUCTION: The discovery of small mole- 
cules with desired functional properties is 
critical to advances in health, energy, and sus- 
tainability. The process is conducted through 
often slow, laborious, iterative design-make- 
test-analyze (DMTA) cycles. Emerging machine 
learning (ML) tools can generate new candi- 
date molecules, predict their properties, and 
propose reaction pathways with computer- 
aided synthesis planning (CASP). Progress in 
chemical automation can enable chemical syn- 
thesis and characterization with minimal hu- 
man intervention after a manual setup. For 
example, automated feedback-guided chem- 
istry platforms have demonstrated chemical 
optimization within narrow, well-defined classes 
of reactions. 


RATIONALE: Integrating ML generative algo- 
rithms, ML property prediction, CASP, robotics, 
and automated chemical synthesis, purifica- 
tion, and characterization into a DMTA work- 
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flow can lead to the development of autonomous 
chemical discovery platforms able to operate 
across varied chemical spaces without manual 
reconfiguration. An ideal property-focused dis- 
covery platform would propose and synthesize 
molecules to enrich ML generative and property 
models and ultimately discover top-performing 
molecules. In reality, it is necessary to exclude 
reactions that the available automation hard- 
ware cannot safely execute. 


RESULTS: Toward autonomous discovery, we 
demonstrate an integrated DMTA cycle that 
iteratively proposes, realizes, and characterizes 
molecules to explore chemical spaces guided 
only by prediction tools. Four molecular scaf- 
folds with potential applications as organic 
dyes serve as exploration case studies, where- 
as a fifth scaffold demonstrates the platform- 
exploiting refined ML models to optimize dye 
properties. Dye-like molecules constitute a com- 
pelling test case as fundamental properties 
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Autonomous molecular discovery. A suite of machine learning tools iteratively plans experiments to learn 
about a structure-property space defined by molecular dye scaffolds. €exp,, mean experimental error. 
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are readily measurable and their realization 
involves multistep synthesis by a wide variety of 
chemical transformations. The platform auto- 
matically characterized three target properties— 
absorption maximum (Amax), octanol-water par- 
tition coefficient (ogK,,,), and photooxidative 
stability (ogkacs)—and used them to refine 
model predictions and inform future experi- 
ment selection. 

A graph-completion generative model designs 
candidate molecules, which are evaluated with 
an ML model for each of the three properties. 
A CASP tool proposes multistep synthetic rec- 
ipes, which are executed by an automated liq- 
uid handler, batch reactors, high-performance 
liquid chromatography (HPLC), and robotic 
arms. A plate reader measures absorption spec- 
tra, calibrated HPLC retention times provide 
water-octanol partition coefficients, and a 
simulated solar light source combined with 
the plate reader quantifies photooxidative deg- 
radation. The measured molecular properties 
are automatically fed back to retrain the prop- 
erty prediction models, completing one step of 
the automated DMTA cycle. Initially, the plat- 
form is highly uncertain about each scaffold 
(i.e., the ML prediction models have signifi- 
cant model errors), so it chooses to realize the 
simplest derivatives to anchor property pre- 
dictions. Subsequent iterations favor more 
synthetically complex derivatives to fill gaps 
in the learned chemical space. Three iterations 
were sufficient for the ML model deviations to 
approach experimental uncertainty and en- 
able exploitation. Human involvement was 
limited to setting and adjusting objectives, 
providing requested materials, and occasion- 
ally fixing unrecoverable errors such as clog- 
ging of the HPLC unit. 


CONCLUSION: We have demonstrated a DMTA 
cycle that explores chemical space and exploits 
known chemical structures without manual 
experimentation. The platform proposed, syn- 
thesized, and characterized 303 unreported 
dye-like molecules spread across four explo- 
ration scaffolds and one exploitation scaffold. 
The integrated platform can explore unknown 
structure-property spaces by searching for 
structures with desired properties (hits) and 
can exploit characterized structure-property 
spaces by optimizing structures of promising 
candidates (leads). Further development of 
closed-loop integrated platforms promises to 
accelerate molecular discovery. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: kfjensen@mit.edu 
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A closed-loop, autonomous molecular discovery platform driven by integrated machine learning tools 
was developed to accelerate the design of molecules with desired properties. We demonstrated 

two case studies on dye-like molecules, targeting absorption wavelength, lipophilicity, and photooxidative 
stability. In the first study, the platform experimentally realized 294 unreported molecules across three 
automatic iterations of molecular design-make-test-analyze cycles while exploring the structure-function 
space of four rarely reported scaffolds. In each iteration, the property prediction models that guided 
exploration learned the structure-property space of diverse scaffold derivatives, which were realized 
with multistep syntheses and a variety of reactions. The second study exploited property models trained 
on the explored chemical space and previously reported molecules to discover nine top-performing 
molecules within a lightly explored structure-property space. 


he efficient prediction, realization, and 

validation of small molecules, polymers, 

and materials with desirable functional 

properties is needed to accelerate ad- 

vances in medicine (/, 2), materials (3-5), 
and sustainability (6, 7). This work focuses on 
organic small molecules that must manifest 
multiple distinct properties simultaneously, 
constraining molecular design and increasing 
structural complexity. Emerging predictive tools 
can quickly generate new candidate molecules 
(8-13), predict the performance of candidates 
(11, 14-16), and propose practical reaction path- 
ways (17-21); meanwhile, chemical automation 
can now dependably conduct experiments with 
minimal human intervention after an initial 
setup phase. Integrating generative algorithms, 
computer-aided synthesis planning (CASP), it- 
eratively updated large datasets, and automated 
chemical synthesis, purification, and charac- 
terization for each step of the design-make-test- 
analyze (DMTA) cycle all into a single workflow 
could improve experiment efficiency and ulti- 
mately enable autonomous chemical discovery. 
Automated feedback-guided chemistry plat- 
forms have demonstrated molecular chemical 
discovery and optimization within narrow 
classes of reactions, performing iterative cross- 
coupling reactions (8, 22), rediscovering reac- 
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tivity (23), optimizing single reactions (24-26), 
and driving enzyme-assisted carbohydrate syn- 
thesis (27). Although these demonstrations high- 
light the potential of integrating predictive tools 
and chemical automation, their specialized im- 
plementations restrict the types of chemistry 
executed, the types of structures that can be 
realized, and notably, the chemical spaces that 
can be explored by these platforms. 
Currently, there is a gap between the gen- 
eral nature of chemical prediction tools, retro- 
synthesis planning and property prediction, 
and the specialized nature of existing chemical 
automation platforms for the synthesis of small 
molecules. Closing this gap can lead to the 
development of autonomous chemical discov- 
ery platforms that can pivot between different 
discovery spaces without the need for manual 
reconfiguration. In the idealized case, a mo- 
lecular property-focused discovery platform 
would propose and evaluate synthesizable mol- 
ecules by using all known reactions to avoid 
constraints beyond those imposed by multi- 
property spaces. This work narrows the gap 
by demonstrating an integrated platform to 
iteratively propose, realize, and validate mol- 
ecules to explore chemical spaces guided only 
by prediction tools. In practice, a minimal set of 
reactions that cannot be safely executed by the 
selected automation hardware and engineering 
controls must be excluded. The platform in- 
tegrates general chemical tools to automati- 
cally propose diverse molecules, plan synthetic 
pathways, execute multistep reaction pathways, 
purify target molecules, and characterize the 
performance of isolated molecules to learn 
about unfamiliar chemical spaces (Fig. 1A). 
By weighing potential challenges with each 
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workflow in the DMTA cycle, the platform ho- 
listically evaluates the utility of a datapoint to 
the models against the experimental complex- 
ities required to realize a proposed molecule. 
Although this is analogous to how a trained 
chemist might approach chemical discovery, 
the autonomous platform can perform this eval- 
uation on multiple properties and multiple 
experimental plans for thousands to millions 
of molecules simultaneously. As a result of this 
holistic consideration, the platform chooses 
unadorned structures to probe unexplored 
chemical spaces and builds more complex 
derivatives to optimize the properties of fam- 
iliar spaces. 

To initiate the automated DMTA cycle, can- 
didate structures can be generated by using 
molecular generation workflows based on ge- 
netic algorithms (8, 9), reinforcement learning 
(10-12), or conditional generation (13, 28). Plau- 
sible pathways to realize candidates can be 
proposed by using retrosynthetic planning 
packages, such as ASKCOS (17, 18), Synthia 
(19, 29), IBM RXN (20), and AiZynthFinder 
(21). The properties of candidates can be pre- 
dicted with quantitative structure-activity rela- 
tionship models or statistical models such as 
Chemprop, a message-passing neural network 
(6). Molecules can be realized with advances 
in chemical automation such as reconfigur- 
able high-throughput platforms (/8, 24, 30) 
equipped with hardware to access diverse re- 
action conditions (37) and molecular property 
characterization (7, 32). Such platforms have 
been used to discover novel chemical transfor- 
mations (32, 33), optimize reactions (34, 35), 
and develop functional materials (7, 26, 33). 
This work integrates these tools to demon- 
strate a platform that can autonomously drive 
molecular discovery. 

We demonstrated the platform (Fig. 1B) in 
two molecular discovery use cases for small 
molecule organic dyes: (i) exploration of un- 
known chemical spaces and (ii) exploitation 
of a known chemical space. The platform exe- 
cutes and automatically adjusts workflows 
as needed, with human intervention being lim- 
ited to managerial (error recovery) and cus- 
todial (providing consumables) actions, both 
during and between iterations. A complete dis- 
cussion on required human action is provided 
in the supplementary materials (SM8). Plat- 
form flexibility is crucial to execute multistep 
reaction pathways containing the breadth of 
reaction classes (Fig. 1C) needed to realize can- 
didates covering the desired property space. 
We validated the platform’s capabilities by 
targeting molecular dyes, a model system with 
diverse chemistry and complex molecular prop- 
erties. During two case studies, the platform 
attempted over 3000 reactions, with more than 
1000 yielding the predicted reaction product, 
completing multistep reaction pathways for 
303 unreported molecules (i.e., without CAS 
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Fig. 1. Overview of the 
integrated platform 

and reactions predicted 

and successfully executed 

by the platform. (A) Illustrations 
of the key workflow components 
of the integrated platform and 
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how each component fits into the 
DMTA cycle in which the proper- 
ties tested are wavelength of 
maximum absorption (Amax), 
octanol-water partition coefficient 
(logKow), and rate of photooxidative 
degradation (logkdeg). (B) Layout 
of the physical hardware of the 
experimental platform. (C) Stepwise 
breakdown of the classes of different 
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registry numbers prior to publication). For 
both case studies, the absorption maximum, 
partition coefficient, and photooxidative sta- 
bility were the targeted properties, and the 
platform automatically measured and recorded 
each property to refine model predictions and 
inform future experiment selection. 


Front-end predictions 


The platform is tasked with either learning the 
properties of a chemical space of interest or 
realizing top-performing molecules within a 
chemical space containing existing examples. 
Before experiments can be executed, the plat- 
form must develop an experimental plan by 
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proposing candidate structures, planning re- 
action pathways and reaction conditions, pre- 
dicting candidate properties, and selecting 
candidates to realize (Fig. 1A, design). These 
prediction tasks are modular and can be ex- 
changed to accommodate different objectives. 

Candidate structures are proposed by using 
a hierarchical graph-completion model (34), 
which takes a molecule scaffold as input and 
completes the molecular graph with learned 
motifs, encoding and decoding full motifs at 
once rather than atom by atom. The model 
was first pretrained on ChEMBL (35) to learn 
general organic chemistry rules and then fine- 
tuned on a curated set of dye molecules (sup- 
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plementary materials, SM2) to complete input 
scaffolds with diverse dye-like structural motifs. 
Graph completion proposes molecules span- 
ning the multiproperty space of dye molecules 
and tends to generate more realistic and fea- 
sible structures compared with other molec- 
ular generation approaches (8-13, 28). Graph 
completion also has the benefit of generating 
diverse derivatives while preserving the input 
scaffold, enabling exploration within a local 
chemical structure family. The technique adds 
common functional groups such as dialkylamino 
or cyano groups, whereas a graph-to-graph trans- 
lation model we tested added polyene chains 
or boron inspired by high-performing cyanine 
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and BODIPY dyes, respectively, without con- 
serving the central scaffold. 

For each of the generated candidate mole- 
cules, multiple synthesis pathways are automat- 
ically planned and proposed by using ASKCOS 
(7, 18), which makes template-based retrosyn- 
thesis recommendations with reaction tem- 
plates extracted from large reaction databases. 
Reaction templates are iteratively applied in 
a tree search to propose reaction pathways 
that terminate at available reagents, connect- 
ing available starting materials, chemicals, 
and generated structures. A set of preferred- 
vendor chemical catalogs—containing SMILES, 
package sizes, vendor product IDs, and prices— 
and our in-lab inventory formed the set of 
reagents available to ASKCOS. Reaction path- 
ways that do not terminate at available re- 
agents within five reaction steps were discarded 
from consideration (supplementary materials, 
SM3). The majority of reaction pathways for 
generated molecules require multiple reac- 
tions, often from different reaction classes. 
By executing multistep reaction pathways, more 
of the multidimensional property space be- 
comes accessible (Fig. 2A, volume), and more- 
over, there are more candidate options within 
the space (Fig. 2A, density). Single-step synthe- 
ses cover only 30% of the space accessible with 
five reaction steps and have 1000-fold fewer 
candidate options (Fig. 2A). In our property 
space of interest, molecules requiring more 
than five reaction steps did not further expand 
the volume of property space but did increase 
complexity and resource requirements. As a re- 
sult, we set a limit of five reaction steps for the 
case studies presented, but other spaces may 
require more steps for adequate exploration. 

Retrosynthesis planning also serves to deter- 
mine which generated candidates are considered 
synthesizable. Depending on the heterocyclic 
scaffold, 10 to 20% of generated molecules re- 
turn synthesis pathways terminating at buyable 
starting materials. Retrosynthesis planning is 
the most time intensive front-end prediction 
needed to create an experimental plan. Unre- 
alistic molecules suggested by molecular gen- 
eration are often filtered with synthesizability 
proxy metrics (36-39); however, commonly 
used metrics did not increase the quality of re- 
action planning (fig. S2), and therefore, they 
were not used to prefilter candidate structures 
during exploration. The continued development 
of more efficient methods to recommend can- 
didates and pathways simultaneously would 
increase platform planning throughput (39). 

Reaction conditions are recommended by 
ASKCOS by using a neural network model 
trained on a large set of literature reaction con- 
ditions (40). The recommendation of reaction 
conditions can be challenging as necessary 
reagents, catalysts, cocatalysts, solvents, and 
reaction temperature must be predicted sim- 
ultaneously. To assist ASKCOS, we defined a 
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set of rules for the 100 most popular chemical 
transformation templates (supplementary mate- 
rials, SM3) against which ASKCOS predictions 
are checked. These rules identify common prob- 
lems such as not recommending all required 
reagents for well-known reactions (e.g., rec- 
ommendation of phosphorus ligand, base, sol- 
vent, and palladium source for cross-coupling 
reactions) or recommending outdated cata- 
lysts when improved catalysts are known. Re- 
action condition recommendations that did not 
match the predefined rules were automatically 
either corrected when a single required chemical 
was missing or removed from consideration 
when multiple issues were present. Although 


defining rules for known reactions can help 
fix some problematic recommendations, the 
continued development of reaction condition 
models is needed to increase the success rate 
of platform-executed reactions. Pathways con- 
taining a reaction without any valid recom- 
mended conditions are automatically discarded. 
The combination of reaction pathways and re- 
action conditions form experimental plans for 
each of the generated molecules (Fig. 2B). 

For candidates with an experimental plan, 
their dye-like properties were predicted by using 
a set of Chemprop (/4, 15) property predic- 
tion models: wavelength of maximum absorp- 
tion, partition coefficient, and photooxidative 
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Fig. 2. Product and reaction pathway selection. (A) Property space (Amax, logKow, and logkgeg) Convex 
hull volume (red) and density of candidates (blue) for the BTPP scaffold shown in the exploration case study 
as a function of reaction pathway length. Dashed lines are splines provided as a guide to the eye. The 
convex hull volume at each pathway length has units of wavelength, logKow, and logkdeg normalized to the 
volume at a reaction pathway length of 5. The density is the similarly normalized number of candidates 


products (orange) and target final products (blue). 


22 December 2023 


per unit of volume. (B) Example of an experimental plan to be scored, including the predicted property 

values (units of nanometers for max and units of logio(s 7) for logkaeg) and reaction pathways, and the 
top-ranked reaction context. The target candidate molecule had four experimental plans for consideration 
red stars in (C)]. (C) Evaluation of experimental plan scores considering the platform feasibility of 

the proposed reaction pathway and candidate molecule value. The space is separated by infeasible (left) and 
feasible (right) candidate pathways. Candidates were deemed infeasible if they required platform-inaccessible 
eaction conditions or reagents. The color scale represents the density of candidates, increasing from 

ow (blue) to high (yellow). (D) Decision cycle for additional reagents to buy by evaluating the products 
and reaction pathways they enable. Upon terminating, the platform has selected products to make, reactions 
to execute, and reagents to purchase. Reaction pathways connect buyable reactants (green) to intermediate 
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degradation rate. Chemprop is lightweight and 
fast, so an ensemble of models can be auto- 
matically retrained from the whole dataset 
with each batch of experimental data (sup- 
plementary materials, SM4) (14-16, 41). The 
ensemble variance is used as a proxy for model 
uncertainty to further inform molecule selec- 
tion. Each of the models was initially trained 
on datasets of different sizes: 26,400 experi- 
mental absorption maxima from a compiled 
literature dataset (42), 23,500 experimental 
partition coefficient values also from a compiled 
literature dataset (43), and 85 experimental 
photooxidative degradation rates measured 
in house for a set of common molecular dyes 
(supplementary materials, SM11). Because of 
the small size of the third dataset, we used a 
random forest model implemented through 
Chemprop to predict photooxidative degra- 
dation rates (supplementary materials, SM4). 
On every iteration of the DMTA cycle, ensem- 
bles of models were trained with experimental 
data measured on realized molecules and the 
initial datasets. Subsequent rounds of prop- 
erty predictions were automatically made with 
the most recent property model version, and 
the predicted properties and uncertainties were 
used in conjunction with experimental plans 
to select the next iteration of candidates for 
experimental realization. 

Selecting candidate molecules is challeng- 
ing, as it involves a tradeoff between optimizing 
multiple properties and executing a success- 
ful synthesis. Additionally, both syntheses and 
properties are predictions from statistical mod- 
els with varying uncertainty, which further 
complicates selection. For an input scaffold, 
there can be thousands of candidates, each with 
up to 10 pathways, and thousands of reagents 
to consider across all pathways. Reactions that 
require inaccessible conditions, require re- 
agents without vendor information, or involve 
blacklisted reagents are automatically identi- 
fied and removed from consideration. Two gen- 
eral categories of infeasible reactions are those 
requiring inaccessible reagents (no platform- 
known source or prohibitively expensive) and 
those involving platform-incompatible reac- 
tion conditions (reaction temperatures outside 
the range of 5° to 200°C or the use of reagents 
that cannot be safely handled by platform). In- 
compatible reagents include reactive gases 
(e.g., hydrogen and carbon monoxide), solid 
reagents that do not form solutions (e.g., alkali 
metals and metal powders.), and highly re- 
active reagents that require special handling 
to be used safely (such as organolithiums). The 
capabilities of the platform can be expanded 
by developing or purchasing special reactors, 
nitrogen-purged enclosures, and other engi- 
neering controls to safely handle these reagents. 
Despite 40% of proposed pathways being in- 
feasible (Fig. 2C), the general nature of CASP 
implemented by ASKCOS means that there are 
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usually proposed alternative pathways with ac- 
cessible reactions, and only around 15% of 
candidates are removed from consideration. 

The feasible candidate pathways are scored 
by their molecular value (the product of scaled 
prediction values and scaled ensemble var- 
iances) and relative platform feasibility (the 
product of inverse reagent cost, inverse path- 
way length, and compatibility of proposed re- 
action conditions with the well plate format) 
(supplementary materials, SM5). Only the high- 
est scoring quartile of candidates are consid- 
ered further (Fig. 2C). Any reagents required for 
the candidates that are not in the laboratory 
chemical inventory are rank ordered by the 
cumulative scores of the top 10 (exploration) 
or top one (exploitation) candidates that they 
are used to synthesize. The top-ranked reagent 
is added to the platform library and candi- 
dates that can be synthesized with the addi- 
tion of the reagent are added to the selected 
products list. All remaining candidates are 
rescored and reagents reranked based on the 
updated library (Fig. 2D). The cycle of evalua- 
tion and selection is repeated until a predefined 
number of candidates are selected or the re- 
agent budget is exhausted. Afterwards, the 
platform generates a document to prompt the 
operator to prepare stock solutions, specifying 
which vendors to purchase from and the mini- 
mum amount of material needed. The operator 
decides how to make stock solutions; solutions 
are prepared at high concentrations in solvents 
such as dimethylformamide, which would in- 
troduce 1 to 5% of stock solution solvent into 
the reaction, or at lower concentrations in vola- 
tile solvents such as chloroform, which would 
evaporate before the reaction begins. Complet- 
ing this planning document serves as an op- 
tional managerial checkpoint for the operator 
to review the experiments to make sure that 
they will be safe. 


Automation and experimental execution 


Platform-selected reaction pathways are auto- 
matically translated (supplementary materials, 
SM6) into synthesis and characterization work- 
flows to be executed in 96-well plates. Well 
plates offer ease of parallelization and suffici- 
ent material throughput (10 to 100 pmol scale) 
for multistep synthesis (44-46). The platform 
batches reactions into well plates on the basis 
of reaction temperature, reagent incompat- 
ibilities, and the order of reactions in multistep 
pathways. A finite sequence of temperatures that 
maximizes the number of selected pathways 
that can be completed is identified (Fig. 3A, 
plating), enabling multiple classes of reactions 
to be executed in the same well plate, and in- 
creasing throughput compared with the man- 
ual approach of grouping reactions by class 
alone. When this procedure leaves empty wells 
in a plate, the platform selects additional mol- 
ecules that can be synthesized with the reagents 
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in the platform library, increasing the use of 
the synthesis hardware and giving additional 
structures for property prediction model re- 
training with minimal additional experimen- 
tal effort. Along with reaction execution tasks, 
high-level goal-oriented tasks to work up the 
reaction, analyze reaction outcomes, isolate tar- 
get products, and characterize the isolated 
molecules are automatically generated, cre- 
ating a series of platform jobs (Fig. 3A, or- 
ganization). Four independent systems with 
different capabilities work simultaneously to 
accomplish these goal-oriented tasks while a 
master controller orchestrates tasks over a local 
network (Fig. 3A, experimental platform). Cur- 
rently, the four systems are a liquid handler, 
a high-performance liquid chromatography 
(HPLC) with mass spectrometry instrument, 
a robotic arm, and a special processes unit that 
manages a storage carousel, a high-temperature 
reactor, a plate reader, and a photodegrada- 
tion reactor. 

Platform systems query two databases: a 
design database and a results database (sup- 
plementary materials, SM7). The design data- 
base tracks platform resources and is used by 
the platform systems to make real-time deci- 
sions on how to accomplish their assigned tasks. 
The results database manages experimental data 
and is used by the platform systems to record 
synthesis outcomes and molecular properties. 
The master controller schedules tasks to ho- 
nor time constraints between tasks, adjusting 
to workflow changes made by the systems as 
needed (Fig. 3B). The occurrence of congestion 
on the platform necessitates additional rules 
for trafficking well plates between systems. 
Performance, a measure of platform efficiency 
(figs. S6 and S7; supplementary materials, SM7), 
is the percentage of well-based or plate-based 
operations, as appropriate, which did not gen- 
erate an error requiring human intervention. 
For example, the HPLC analyzes the contents 
of every well, whereas the robotic arm moves 
an entire well plate at once, so their perform- 
ances are evaluated per well and per plate, re- 
spectively. The performance of all systems 
exceeded 95% during the final 30 jobs for the 
exploration case study, which involved >1600 
reactions (Fig. 3C) that were carried out across 
910 hours of operations in a 5-month period. 
An analysis of the platform’s performance is 
presented in the supplementary materials 
(SM7). It is important to note that although 
other high-throughput experimentation plat- 
forms can achieve higher throughputs (>1000 
single-step reactions per day) for reaction con- 
dition screening, they do so for a narrow set of 
single-step reactions (e.g., C-C or C-N coupling) 
(46-48). The platform throughput is limited 
to ~200 reactions per day by HPLC analysis of 
reaction outcomes, but because of the wide 
scope of the workflows (prediction, execu- 
tion, isolation, characterization, and model 
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Fig. 3. Flow of information A 
during experiment design, 
orchestration, execution, 
and data generation. 

(A) Platform-selected reac- 
tions from front-end pre- 
dictions (Fig. 2) are batched 
into well plates, and the 
sequence of well plates with 
the highest value set of 
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30 jobs of the exploration case study. 


retraining), our realized throughput is lower 
and depends on the platform-selected chemistry. 
Other control architectures, such as those im- 
plemented in Chemputer (49, 50) and ChemOS 
(51), were considered, but none supported par- 
allel operation in well plates, work up and iso- 
lation, characterization, and the ability to modify 
workflows spontaneously. The described plat- 
form can perform multiple unrelated jobs in 
parallel and only requires human intervention 
for nonautomated error recovery and restock- 
ing (see “The role of the platform operator’; fur- 
ther detailed in supplementary materials, SM8). 

To begin executing a job, the master con- 
troller checks that the first required system is 
available and that subsequent systems will 
have timely availability, ensuring that ongoing 
operations are not interrupted (Fig. 3B). Syn- 
theses begin with reaction preparation in 96-well 
plates: The liquid handling robot finds the re- 
quired reagents, dispenses them at appropri- 
ate volumes depending on stoichiometry and 
stock solution, and dispatches the reaction 
well plate to a reactor capable of the required 
reaction conditions [e.g., palladium-catalyzed 
reactions are sent to a nitrogen purged reactor; 
custom reactors are described in the supple- 
mentary materials (SM11)]. If the needed re- 
agent is a product of a previous reaction, the 
volume dispensed is adjusted on the basis of 
the estimated reaction yield (supplementary 
materials, SM9). Reactions are run for 8 hours, 
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which is excessive for faster reactions and in- 
adequate for the most sluggish ones. Selecting 
an optimal duration for well plates containing 
many different reactions is not currently pos- 
sible; quantitatively predicting kinetics for any 
proposed substrate, catalyst, and reagent com- 
bination is very challenging. How to predict 
more nuanced reaction conditions and dura- 
tions for diverse reaction classes is an unsolved 
challenge that requires scalable predictions of 
chemical kinetics for complex molecules. 

After executing reaction tasks, a series of 
automatically selected and executed work-up 
tasks process the crude products in prepara- 
tion for ensuing reactions and HPLC analysis. 
Reactions with inorganic reagents such as car- 
bonate or palladium complexes are extracted 
with chloroform and water in deep well plates, 
and all reactions are filtered through filter well 
plates. For intermediate products, a small sam- 
ple ( to 10%) of the filtered reaction is trans- 
ferred to a new plate and diluted with dimethyl 
sulfoxide to dissolve the target products in a 
reverse-phase HPLC-compatible solvent. The 
reaction samples are transferred to the HPLC 
autosampler by the robotic arm and analyzed 
with a standardized water and acetonitrile gra- 
dient (supplementary materials, SM9). Reac- 
tion success was determined by identifying 
expected reaction product masses, and reac- 
tion yield was estimated from the HPLC chro- 
matogram measured by the photodiode array 
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detector. Operation and data export were au- 
tomated through a vendor-supplied application- 
programming interface, and analysis occurred 
in real time (supplementary materials, SM9). 
For final products, the reaction outcome is an- 
alyzed in the same manner, but the majority 
of the filtered reaction is added to the HPLC 
analysis well plate. The retention time is used 
in a subsequent semipreparative HPLC run to 
isolate final products. The two-run method 
proved robust, but retention time prediction 
and real-time control of eluent composition 
could enable more efficient isolation without a 
prerequisite analytical run. Although not every 
product peak was baseline separated from 
by-products, our property assays capitalized 
on dye-molecule absorption of visible light, 
minimizing the influence of any ultraviolet- 
absorbing impurities on the property mea- 
surements. Although isolation of intermediate 
products is desirable, limitations of the cur- 
rent platform instruments make this imprac- 
tical. Therefore, we used filtered intermediate 
reactions as feedstock for subsequent reac- 
tions, with around one-third of the multistep 
reaction pathways yielding the targeted final 
products. This success rate could be bolstered 
by preparative-scale HPLC purification but is 
not required to demonstrate autonomous mo- 
lecular discovery. Given the limitations of 
small reaction scales and multistep reaction 
pathways, the operator must scale up reactions 
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and take NMR spectra off of the platform. Struc- 
tural validation is a challenge for discovery- 
scale platforms but can be accomplished with 
NMR after interesting chemical spaces have 
been discovered. 

For each property of interest, measurement 
and data processing protocols are defined be- 
forehand (supplementary materials, SM9 and 
SM10) and are automatically applied to iso- 
lated products. Platform systems add mea- 
sured data to the results database in real time 
for use when retraining the property predic- 
tion models. Absorption spectra are measured 
with a plate reader, partition coefficients are 
extracted from calibrated HPLC retention 
times, and photooxidative degradation rates 
are measured in a custom-built solar degra- 
dation device (supplementary materials, SM12). 
The device simulated sunlight at 10% solar 
intensity while periodically collecting absorp- 
tion spectra in a plate reader; by quantifying 
the change in intensity of the pure sample spec- 
trum deconvoluted from the measured spec- 
trum, the rate of degradation could be extracted 
(supplementary materials, SM11). Additional 
properties can be measured with new hard- 
ware or workflows such as biological assays, 
and parallel implementation allows new hard- 
ware to work alongside existing hardware. 


The role of the platform operator 


Efforts toward realizing autonomous chemical 
discovery platforms have highlighted minimiz- 
ing human intervention; however, human op- 
erators continue to have essential roles in the 
operation of these platforms. In this study, 
humans first need to set up platform goals 
and initialization operations (supplementary 
materials, SM8). During platform operation, 
operators oversee proposed experiments, help 
recover errors, and manage resources. One com- 
mon source of platform errors requiring hu- 
man intervention is hardware collision, which 
requires the operator to investigate the causes 
of the collision before reenabling the affected 
subsystems. These collisions arise from tight 
tolerances and robotic arm training drift that 
could be reduced by incorporating autocali- 
bration equipment to preventively retrain po- 
sitions. A particularly temperamental platform 
component is the HPLC, which requires an 
operator to bring it back online after errors 
such as clogged plumbing. Hardware or soft- 
ware solutions to improve robustness against 
common errors could be developed, such as 
analyzing which reactions lead to HPLC clogs 
and changing how those reactions are con- 
ducted. After set up, the platform proposes a 
series of experiments with operational plans. 
The operator may choose to review these pro- 
posed experiments to determine if they are 
safe to execute and whether they address the 
discovery goal well enough to invest platform 
time and resources; if not, the operator refines 
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the goal or provides more tools to the plat- 
form. Afterwards, while the platform executes 
experiments, the operator is involved in plat- 
form resource management (keeping the plat- 
form stocked and purchasing and preparing 
stock material; supplementary materials, SM8). 
These managerial and custodial tasks are key 
factors that enable autonomous discovery and 
are often omitted when describing the opera- 
tion of automated chemical platforms. 

Despite progress, it is unlikely that human 
intervention will be completely removed from 
autonomous chemical discovery platforms in 
the near term and will continue to be a key 
point in ongoing conversations as autonomous 
platforms continue to advance. However, to 
assess autonomy in chemical discovery plat- 
forms (52, 53), five items must ultimately be 
considered: (i) The scope and context of the 
discovery goal, (ii) the degree of choice af- 
forded to the platform to accomplish the goal, 
Gii) the scale of real-time process-level feed- 
back and adaptation, (iv) the manner in which 
experimental data informs future experiment 
design, and (v) the extent of platform automa- 
tion. A detailed breakdown of these items is 
presented in the supplementary materials (SM8). 
As an illustrative comparison, specialized re- 
action optimization platforms are tasked to 
describe a reaction condition-yield landscape, 
but the scope of that discovery goal is narrow, 
and the experiment choices that can be made 
are highly constrained (adjustments to concen- 
trations, temperatures, etc.) (54). These dis- 
covery tasks can be accomplished with fixed 
hardware configurations, minimal workflow 
flexibility, and can be guided by routine opti- 
mization algorithms. Although these platforms 
are self driving, their degree of autonomy is 
restricted to small, well-defined scopes and 
discovery goals. The platform we developed 
aims to explore general structure-property 
chemical spaces by leveraging the power of 
general machine learning predictive tools, only 
limited by safety considerations. As more chem- 
istry is conducted by general-purpose autono- 
mous discovery platforms, workflows and 
predictive models can be continually refined, 
further shifting the role of the operator toward 
supervision. 


Exploration case study 


Molecular exploration of chemical spaces be- 
yond current datasets aims to identify produc- 
tive spaces and determine whether molecules 
within those spaces have a set of desired prop- 
erties. Property prediction models in these un- 
explored chemical spaces are uncertain because 
relevant training examples are not present to 
anchor predictions. To improve the property 
prediction models in these spaces, the platform 
first proposes a large number of candidates, 
then experimentally realizes several low-cost 
examples, and finally uses those examples to 
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anchor future predictions. This approach al- 
lows the models to understand the chemical 
space with accessible chemistry and avoid in- 
vesting excessive time and resources into syn- 
thetically complex, predicted high-performing 
molecules. With more accurate, anchored mod- 
els, a further evaluation can narrow down the 
potential chemical space to a small number 
of hits for further exploration. This approach 
to molecular discovery is typical for identify- 
ing promising hit chemical spaces in hit-to- 
lead workflows. 

To demonstrate molecular exploration, we 
prompted the platform to explore the proper- 
ties of unexplored heterocyclic dye-like mole- 
cules. To find candidate scaffolds, we manually 
filtered ring structures with fewer than eight 
hits in the ZINC database (supplementary ma- 
terials, SM1), as many molecular dye families 
are based around conserved, conjugated het- 
erocyclic scaffolds (like xanthenes and cou- 
marins). Cyclization reactions are critical to 
form heterocyclic structures but are challeng- 
ing for retrosynthesis planners because mul- 
tiple bonds form simultaneously (55). We 
manually inspected ASKCOS-recommended 
cyclization reactions for the ZINC hits to ver- 
ify the ring-forming reaction templates and 
selected four heterocyclic structures to serve 
as exploration scaffolds (Fig. 4A). The graph- 
completion model automatically generated 
candidates by allowing any combination of 
C-H bonds on each scaffold to be functional- 
ized (fig. S1). To explore the property space of 
these heterocyclic scaffolds, we prompted the 
platform to perform three rounds of molecu- 
lar exploration. Beforehand, we provided a 
set of preferred-vendor chemical catalogs to 
the platform that, with the in-lab inventory, 
determined the domain of possible molecules. 
In each round, the platform considered prop- 
erty predictions, the practical constraints of 
reaction platform-feasibility, and reagent avail- 
ability to select candidates to realize and char- 
acterize. Before starting subsequent rounds, 
the characterization data were used to up- 
date the property prediction models (discussion 
of the models follows in Fig. 5). The platform 
then used the retrained property models to 
drive a new iteration of the DMTA cycle. 

Across the three rounds of exploration, the 
platform realized between 40 and 130 deriv- 
atives of each scaffold (Fig. 4A); the variation 
reflected differences in reaction fidelity. Rec- 
ommended reaction pathways vary in length 
and retrosynthetic strategy, both between and 
within scaffolds. In some property space re- 
gions, the diversity and complexity required 
can be accessed by coupling commercial re- 
agents together in single-step reactions. In other 
cases, starting materials with appropriate func- 
tional handles are available such that after 
forming the conserved heterocyclic scaffold, 
familiar reactions like palladium-catalyzed 
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Fig. 4. Chemistry predicted and 
attempted for the exploration 
case study. (A) Structures of the 
four scaffolds of interest explored 
by the platform: pyridoimidazo- 
quinoline (PIQ), IPP, BTPP, and 
TBT. Shading color corresponds to 
the scaffold throughout the figure. 
(B) Diagram showing different 
products (solid bars, color coded 
by scaffold) synthesized by a 
variety of reaction types (hatched 
bars, color coded by reaction 
classification) and reaction 
pathway lengths. The sizes of the 
bars are proportional to the 
number of successful reactions 
and products. (€) Examples of 
successful reaction pathways for a 
derivative through a single-step 
pathway for the PIQ scaffold 
(dotted pathways), a two-step 
pathway for the IPP scaffold 
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cross couplings can be used to build the 
needed diversity and complexity. For more 
complex cases, reaction pathways begin by 
synthesizing reagents with appropriate func- 
tional handles before executing the critical 
cyclization reaction (Fig. 4B). When looking 
at the executed reaction pathways for each 
scaffold, the successful routes go through sim- 
ilar diversification strategies containing a key 
cyclization reaction (Fig. 4C). For each scaf- 
fold, a variety of reaction classes [based on the 
RXNO ontology (56)] were attempted to re- 
alize the automatically selected derivatives 
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(Fig. 4D). The reaction success rate depends 
on the scaffold and reaction class. For exam- 
ple, a group of pathways that use a Friedel- 
Crafts acylation step were attempted for the 
indenopyridopyrazinone (IPP) derivatives (Fig. 
4D), but the deactivated nature of the IPP scaf- 
fold prevented these reactions from succeed- 
ing. Such failure of chemistry did not halt the 
platform; rather, the reaction failure was 
recorded in the results database and depen- 
dent reaction pathways and characterizations 
were automatically disregarded in future op- 
erations. Varied reaction outcomes are often 
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Filtration Plate 
Deep Well Plate 


Reactor Block Plate 


the result of insufficient specificity of reaction 
templates used in retrosynthesis planning or 
poor condition recommendations for uncom- 
mon reaction templates. Although outside of 
the scope of work presented, negative reaction 
outcomes could be used to improve retrosyn- 
thesis planning with each iteration. Similarly, 
successful cyclization reactions could be used 
to enhance model confidence when recommend- 
ing those important reactions. Altogether, the 
platform selected 720 molecules to realize 
with an average pathway length of 2.6 reactions, 
requiring the platform to process 37 reaction 
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Fig. 5. Changes in property prediction models during iterative exploration. (A) Changes in the 


molecules. 


well plates and 181 additional well plates for 
workup and characterization (Fig. 4D), result- 
ing in 294 previously unreported molecules 
being realized. 

Property predictions for candidates made 
with initial property models (Fig. 5A, round 0) 
spanned a range of values and were relatively 
uncertain for the heterocyclic scaffolds of in- 
terest. These initial models were trained on 
datasets that were either extracted from lit- 
erature or measured in house and contain no 
examples of the four scaffolds of interest. For 
exploration, the candidates that are most val- 
uable are affordable, easy to synthesize, have 
properties that approach our target property 
space (redder absorbing, smaller partition co- 
efficient, and photooxidatively stable), and 
have sizable prediction model ensemble vari- 
ance (fig. S3 and supplementary materials, 
SM5). This approach favors diverse structures 
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predicted property space with the general property model (Round 0) and following each round of exploration 
(Rounds 1 to 3). The shaded colors are based on scaffold coloring from Fig. 4 and apply to all subfigures. 
(B) Changes in the models’ MAEs for each version of the Amax, logKow, and logkje, models evaluated by 
using cross-validation. Dashed black lines correspond to MAE values for all scaffolds combined. The 
experimental uncertainty (Uexp) is shown in each plot as a magenta dashed line. Exp., experimental; pred., 
predicted. (©) Comparison of predictions and experimental values for each of the property prediction 
models as initially trained and following three rounds of exploration. Ntrain, NuMber of initial training 


that move the derivatives’ predicted properties 
toward the desired property space while pro- 
viding information that will reduce the en- 
semble variance. As seen in Fig. 5, B and C, the 
absorption maximum, partition coefficient, 
and photooxidative degradation rates for the 
automatically realized and characterized mol- 
ecules were substantially different from initial 
model predictions. 

By iteratively and automatically realizing 
and measuring diverse derivatives, the plat- 
form learns about the structure-property space 
for the scaffolds of interest, improving the prop- 
erty prediction quality. Initial property predic- 
tion agreement with experimental values was 
subpar, with mean absolute errors (MAEs) of 
52 nm for wavelength of maximum absorption, 
1.3 for partition coefficient, and 0.30 logyo(s*) 
for the rate of photooxidative degradation (Fig. 
5B, round 0). The first round of exploration pro- 
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vided between 10 and 45 examples of each 
scaffold, depending on the complexity and 
success rate of the pathways for that scaf- 
fold. This handful of examples anchored the 
property models, shifting the predicted prop- 
erty space (Fig. 5A) and improving the models’ 
MAEs (Fig. 5B), but the impact was not equal 
across properties or scaffolds. To investigate the 
impact of adding additional data, the platform 
performed two additional rounds of chemical 
exploration using the same candidate selection 
criteria but with updated property models and 
hence, candidate values, after each round. Dur- 
ing three rounds of exploration—synthesizing 
110, 110, and 90 molecules in the first, second, 
and third rounds, respectively—the property 
models improved for all scaffolds (Fig. 5, B and 
C), but the degree of improvement varied, and 
the impact of new data diminished for all 
property models with each round of explora- 
tion (Fig. 5B). For each property of interest, the 
number of experimental data points varied de- 
pending on the signal requirements for the 
assay. The partition coefficients were measured 
by using a calibrated retention time and were 
recorded for all identified products, resulting 
in 294 data points. Sufficient material had to 
be isolated to measure the absorption spec- 
trum, and additional material was needed to 
measure the photooxidative degradation rate, 
resulting in 245 spectra and 211 first-order 
kinetic rate constants. Following the addi- 
tion of the successful experimental data, the 
MAEs decreased to 11 nm, 0.7, and 0.21 logy9(s“). 
For each of the scaffolds, the predicted prop- 
erty spaces (Fig. 5A) shifted toward ground- 
truth values as the property models learned 
about the structure-property spaces (Fig. 5, B 
and C), and for partition coefficients, the final 
MAE approached the limit imposed by exper- 
imental uncertainty. 

Although the models’ predictions improved 
for each scaffold, there was little change in 
predictive ability across general chemical spaces, 
demonstrating the local nature of these structure- 
property spaces. By verifying a handful of local 
experimental examples to anchor model pre- 
dictions, candidate scaffolds that perform well 
can be more confidently considered for further 
examination. Initial property models both over- 
predicted [such as IPP absorption maxima or 
(triazolobenzothiadiazinone) TBT photooxidative 
degradation rate (Fig. 5)] and underpredicted 
[such as TBT or benzothienopyridopyrimidinone 
(BTPP) absorption maxima (Fig. 5)] the real 
performance of the candidate scaffolds de- 
pending on the local structure-property space. 
Despite a sizable number of datapoints from 
literature, the initial property models still did 
not extrapolate well to unexplored scaffolds. 
This exemplifies the challenge of relying only 
on generative and property-predicting models 
to guide molecular discovery, an ongoing chal- 
lenge in untested chemical spaces. 
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Our platform overcomes this challenge by au- 
tomatically designing and executing experiments 
with minimal human assistance, verifying in- 
teresting chemical spaces by first focusing on 
improving property-predicting models with 
relevant local examples before attempting 
complex and expensive molecular targets. For 
exploring these unknown chemical spaces, the 
ensemble variance appears to be a simple and 
effective metric to select regions about which 
the models are less confident for active learn- 
ing campaigns. Looking more closely at each 
property, the absorption maximum and photo- 
oxidative stability predictions have larger 
ensemble variances than their corresponding 
experimental measurement variances, where- 
as the ensemble variances from the partition 
coefficient model prediction nearly match the 
experimental variances. As a result, as the 
accuracy of the partition coefficient model 
improves, its precision decreases (measured 
as ensemble variance), whereas for the other 
two models, accuracy and precision improve to- 
gether. Although ensemble variance is a straight- 
forward value to calculate and understand, as 
a new chemical space is explored, ensemble 
variance becomes a less efficient guide for 
properties with appreciable experimental error. 


Exploitation case study 


Property-predicting models can be exploited 
to realize potential top-performing molecules 
in chemical spaces containing preexisting data. 
Model exploitation naturally follows molecular 
exploration and is analogous to hit identif- 
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ication moving to lead optimization. Exploita- 
tion can be achieved on the same integrated 
platform as exploration by reweighting how 
candidates are valued. Predicted top perform- 
ers are typically more synthetically complex 
and expensive, so rather than realizing several 
low-cost candidates, only a few confidently 
predicted top performers are selected. 

To demonstrate model exploitation, we chose 
to optimize benzophenothiazinone (BPT) (Fig. 
6A), a promising scaffold with sufficient cy- 
clization yield and good model agreements in 
test experiments (fig. S15), a modest number 
of existing data points providing opportu- 
nity to discover new molecules, and structural 
similarities to other common dye families. We 
updated the property models with three exam- 
ples of the scaffold to represent the promising 
hit and prompted the platform to discover lead 
molecules that simultaneously maximized ab- 
sorption wavelengths, minimized partition 
coefficients, and minimized photodegradation 
rates (Fig. 6B). Although it is tempting to con- 
sider predicted property values alone when 
evaluating candidates (Fig. 6C), molecular 
generation is known to propose unusual struc- 
tures (39) that may be outside the scope of 
property prediction models and have unreal- 
istic predicted values. By considering model 
ensemble variance, synthetic accessibility score 
(36), and total reagent cost during candidate 
selection, the influence of out-of-scope struc- 
tures can be minimized (Fig. 6C), thus valuing 
realistic candidates more highly and shifting 
values away from the optima idealistically de- 
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fined by properties alone (Fig. 6B and fig. S4). 
The relative importance of these additional 
considerations can be weighted depending 
on acceptable tolerances to unusual chemi- 
cal structures and prediction uncertainties in 
the candidate space. 

The reaction pathways for the generated 
BPT derivatives all contain an established cy- 
clization reaction involving the condensation 
of a halogenated naphthoquinone and a sub- 
stituted 2-aminobenzenethiol (Fig. 6A). Several 
C-H sites on BPT were not accessible within a 
feasible number of reactions or from purchas- 
able chemicals (Fig. 6A). From the generated 
derivatives, the platform automatically selected 
14 of the highest-valued derivatives (Fig. 6B) to 
be realized, a selection of which are shown in 
Fig. 6C (remaining structures in SM13). Because 
of the air sensitivity of 2-aminobenzenethiol 
derivatives, the platform failed to realize some 
candidates that were attempted (fig. S13). For 
the nine candidates that were successfully re- 
alized and characterized, the predicted absorp- 
tion maxima (MAE, 11 nm) and photooxidative 
degradation rate (0.4 log units) showed good 
agreement with experimental values, whereas 
the partition coefficient was consistently under- 
predicted by 2 log units. The performance of 
the BPT scaffold is appealing and could war- 
rant continued investigation: It is structurally 
similar to phenoxazine dyes such as Nile blue, 
which was investigated for targeted delivery 
of photogenerated singlet oxygen (57) but is 
>10-fold more photooxidatively stable. More- 
over, the molecules are fluorescent (fig. S9), 
suggesting the family might have additional 
photochemical activity to explore. 

On inspection, the modest agreement of the 
partition coefficient values is not unexpected, 
as the derivatives that were selected by the plat- 
form all contain a primary aromatic amine. 
Although amine-containing derivatives can 
offer improved properties over amine-free 
derivatives, the prediction of their partition 
coefficients is challenging because their pK, 
values (where K, is the acid dissociation con- 
stant) can span a wide range, with charged 
species partitioning more into the aqueous 
phase. The partition coefficient model was not 
trained on enough aromatic amines to learn 
that the pK, of 1-amino-BPT is too low for the 
derivatives to be ionized at the measured con- 
ditions. In practice, the partition coefficient of 
bioactive molecules is improved by introduc- 
ing sulfonate groups that have sufficiently low 
pK, values to be fully ionized in aqueous me- 
dia. However, for our chemical space and prop- 
erties of interest, amines strongly improve 
absorption maximum and photooxidative sta- 
bility, whereas sulfonates only improve the par- 
tition coefficient. Because the importance of our 
properties of interest were equally weighted, the 
platform selected amine derivatives, improv- 
ing two properties and modestly improving 
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the third. To maximize the improvement of the 
least improved property, a different selection 
function is needed, such as nondominated 
sorted or dynamic property weighting (58). 
Additionally, as the number of properties to 
consider grows, introducing property weight- 
ings will be advantageous to prioritize more 
critical properties, making new selection func- 
tions more desirable. 


Conclusions 


Discovery of molecules with targeted proper- 
ties has historically been driven by manual 
experimentation, chemists’ intuition, and an 
understanding of mechanisms and first prin- 
ciples. We have co-opted recently developed 
tools that assist chemists with the DMTA cy- 
cle to automatically explore chemical space 
and exploit known chemical structures. In the 
classic approach to the molecular discovery 
cycle, a hit molecule is altered to understand 
structure-function relationships; the platform 
we demonstrate does this without manual ex- 
perimentation. The platform proposed, syn- 
thesized, and characterized 303 unreported 
dye-like molecules spread across four explo- 
ration scaffolds and one exploitation scaffold. 
The same integrated platform was able to both 
explore unknown structure-property spaces by 
searching for hit structures and exploit known 
structure-property spaces by optimizing lead 
structures. By the final rounds of operation, a 
typical job might encounter a single fault re- 
quiring operator intervention while prepar- 
ing, executing, working up, and analyzing a 
well plate of up to 96 reactions (over the course 
of >1600 reactions). Although we present a 
general autonomous chemical discovery plat- 
form, there are areas in which autonomy can 
be further developed. There are cases in which 
human intervention can be further reduced by 
implementing existing, though often costly 
and less accessible equipment across the plat- 
form, such as commercial devices to dispense 
powders and prepare stock solutions. Addi- 
tionally, the robust prediction of product dis- 
tributions, by-products, and the corresponding 
solubilities in different environments would 
enable improved predictive purification and 
preemptive identification of potential exper- 
imental issues; however, these models do not 
exist. Reinforcement learning could also be 
used to develop a variety of workflows (59) 
that could help develop new operations or re- 
fine existing workflows, introducing a new 
dimension of platform adaptability for future 
versions of autonomous chemical discovery 
platforms. 

Dye-like molecules represent a compelling 
property space and chemical space to explore 
and exploit; multistep syntheses applying a 
variety of chemical transformations are needed 
to access structures with a range of properties. 
The modular architecture of the integrated 
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platform we demonstrated allows additional 
modules and workflows to be added as needed 
for different properties, such as biological ac- 
tivity, without impacting existing platform 
capabilities. Future iterations of the platform 
will benefit from improvements in predictive 
capabilities, particularly reaction fidelity, con- 
dition recommendation, and molecular gen- 
eration, as well as analytical tools such as 
structural elucidation by means of HPLC or 
standalone detectors. The ongoing develop- 
ment of closed-loop integrated platforms is a 
promising path to continue accelerating mo- 
lecular discovery. 


Materials and methods 


The following discussion summarizes the user 
workflow for an example scaffold (BTPP deri- 
vative from Fig. 2B). A more detailed description 
of autonomous candidate generation, candidate 
property evaluation and synthesis planning, 
model retraining, and all platform operations, 
including chemical handling, reaction prepa- 
ration, execution, workup, and analysis, and 
product property analysis can be found in the 
provided supplementary materials. 


Workflow initiation and platform operation 
(by a human operator) 


The DMTA cycle must be instantiated by an 
operator who provides three configuration files 
to enable the workflow to run autonomously: 
(i) a set of Chemprop model parameters de- 
tailing each model’s training, initial/literature 
data, and a specific goal for each property (max- 
imize, minimize, or target a specific value), (ii) 
a scaffold from which to generate candidate 
molecules—in this work scaffolds were checked 
to ensure each had few literature precedents 
for discovery in unmapped chemical spaces— 
and (iii) a catalog of materials that can be 
handled by the automated synthesis machin- 
ery with associated costs and total budget. 
After starting the DMTA cycle, the operator 
must support the autonomous workflow by 
providing materials requested by the plat- 
form. Most requests entail providing chem- 
icals not already in the platform library or not 
present in sufficient quantity; typically, such a 
request is fulfilled by refilling a solvent vessel 
or providing a stock solution in a septum- 
capped vial, either a 2-mL HPLC vial or 20-mL 
scintillation vial. Occasionally the platform 
may request restocking of consumable labware, 
such as well plates, lids, pipette tips, etc., but 
these are stored on-platform in sufficient quan- 
tity for multiple rounds of the DMTA cycle. 
When fulfilling requests, the operator enters 
identities and locations for labware and vol- 
umes and concentration for solutions into a 
platform-generated spreadsheet for easy up- 
dating of the platform library. Platform re- 
quests are made immediately before starting 
uninterruptible tasks, as scheduling and pre- 
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dicting material needs across multiple parallel 
and mutable jobs is impractical. Therefore, for 
the platform to run at capacity, an operator is 
required to monitor the system. On rare occa- 
sions, the operator may need to rectify an er- 
ror such as a robotic collision; these events are 
almost always traced back to human error, 
such as placing a piece of labware in the wrong 
location but could be addressed with additional 
computer vision and advanced automated in- 
ventories, for example. 


Chemprop model construction 


Models for predicting molecule properties were 
all built using Chemprop, a software package 
available at https://github.com/chemprop/ 
chemprop. For the larger dataset sizes of the 
absorption maximum and partition coefficient 
models, predictions were made using message- 
passing neural networks (MPNN) followed 
by conventional feed-forward networks. For 
small dataset sizes, random forest (RF) mod- 
els have comparable or better performance 
than MPNNs. Therefore, the photooxidative 
degradation rate model was implemented 
with a RF within the Chemprop environment. 
Uncertainty was estimated with the variance 
of predictions from an ensemble of Chemprop 
models. Models were hyperparameter opti- 
mized before beginning the discovery cam- 
paign (training parameters can be found in 
table S2). Chemprop automatically implements 
the different model architectures, ensembling 
for uncertainty estimation, and hyperparameter 
optimization. To demonstrate the rate of dis- 
covery, either 10-fold or leave-one-out cross- 
validation was performed (depending on the 
number of new data points), although in op- 
eration, validation is not required as newly 
synthesized molecules will not be in the train- 
ing data. 


Candidate generation, reaction planning, 
and experiment selection 


Methods for candidate generation, reaction 
planning, and experiment selection are covered 
in the main text (Front-end predictions) and 
supplementary materials (SM1 to SM6). 


Reaction Execution and Workup 


To achieve the desired reaction throughput, 
reactions were run in parallel in 96-well plates. 
All well plates were prepared with a Tecan 
Freedom Evo 200 liquid handler equipped 
with a liquid-handling (LiHa) arm, a multi- 
channel arm (MCA), and a robotic manipu- 
lator (RoMa) arm that pipette individually 
addressed wells, pipette 96 wells in parallel, 
and move well plates, respectively. Addition- 
ally, workflows were developed for the RoMa 
to assemble and disassemble devices on the 
liquid handler bed, such as the Te-VacS micro- 
plate filtration system. Reaction groups taking 
place between 4° and 120°C were run on a 
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Inheho Teleshake or Thermoshake on the Tecan 
Evo bed. To accommodate the dilution and 
decrease in material with each step in a multi- 
step synthesis, the platform multiplies the sin- 
gle step concentration (generally about 15 umol) 
by the number of subsequent steps, so the first 
reaction in a three-step reaction would be pre- 
pared at 45 umol, the second step at 30 umol, 
and the final step at the desired 15 mol. For air- 
and water-sensitive reaction groups, reactions 
were run on a dedicated Teleshake with a re- 
movable cover (by the RoMa) that has (re- 
placeable) septa over each well and is constantly 
purged with nitrogen (supplementary mate- 
rials, SM12), allowing pipetting into wells and 
heating to reaction temperatures with mini- 
mal exposure to air. For well plate compatibil- 
ity, reactions taking place below 80°C were 
run in polypropylene well plates (1.8 mL work- 
ing volume) while those above 80°C were run 
in Paradox aluminum well plates with glass 
inserts (0.8 mL working volume). A separate 
Precise Flex 400 collaborative robotic arm 
outside of the liquid handler was used to trans- 
fer well plates and resources between systems. 
Reactions beyond 120°C were run in a custom- 
made oven (supplementary materials, SM12) 
purged with nitrogen and sealed by means of 
an electric linear actuator uniformly pressing 
a fluorinated polymer film over an aluminum 
well plate with glass inserts (Para-dox Plate). 

Crude reaction workup used the liquid han- 
dler to perform both extraction and filtration. 
For extraction, the target product is dissolved 
in chloroform and water is used to extract 
water-soluble impurities and salts. Use of 
chloroform ensures that the organic phase is 
on the bottom of the well and allows use of 
the MCA to extract all 96 wells at once with- 
out concern about the location of the boundary 
between the phases. Additionally, as chloro- 
form is easily evaporated away with mild 
heating it is used to transfer material between 
wells, allowing the material to be redissolved 
in the predicted solvent. This is especially 
useful during filtration, which uses 96-well 
polypropylene filter well plates (0.45 um) 
with the Te-VacS microplate filtration sys- 
tem to collect filtrate from each well. Filtra- 
tion is required to prevent disruptions to HPLC 
operation. The reverse-phase HPLC system 
(water/acetonitrile gradient) cannot handle 
chloroform, so after filtration, an aliquot of the 
product is dissolved in dimethyl sulfoxide and 
the chloroform is evaporated. 

HPLC was performed with a Shimadzu 
Nexera series system equipped with a photo- 
diode array and mass spectrometer detectors 
and a well plate compatible autosampler. The 
mass spectrometer was used to identify tar- 
get products, which were isolated as (nearly) 
pure material by collecting the product peak 
with a well plate compatible fraction collector. 
With a semipreparative HPLC column, enough 
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material could be collected to assay the new 
molecules’ properties of interest. As with the 
Tecan Evo, all operations of the Shimadzu 
Nexera were automated and coordinated by 
the master control network (supplementary 
materials, SM7). 


Property Assays: Absorption maximum, partition 
coefficient, and photooxidative stability 


Partition coefficients were determined with a 
calibrated HPLC retention time method (cal- 
ibration and fitting data are presented in sup- 
plementary materials, SM10) based on an 
initial screen of 30 common molecules using 
the same gradient used for analytic screening. 
Absorption spectra were measured with a 
Tecan Spark plate reader. Sample well plates 
were loaded into and unloaded from the Spark 
by the RoMa. Spectral measurements were ex- 
ported, and the raw and background-subtracted 
spectra were submitted to the experimental 
results database (control integration and data 
processing details can be found in supplemen- 
tary materials, SM11). Photooxidative stability 
was measured with the Spark in conjunction 
with a custom-built reactor attached to the 
Spark on its second door. The reactor was purged 
with heated air (40°C) and utilized a forklift to 
hold the well plate under constant illumina- 
tion, broad spectrum light from a xenon arc 
lamp (42 mW/m? average intensity). Periodi- 
cally, the plate would be transferred back to the 
Spark for absorption spectra measurements. 
Using cumulative exposure time and estimated 
pristine compound spectrum tracking through 
spectral deconvolution (supplementary mate- 
rials, SM11), a first-order degradation rate con- 
stant was determined for each sample. 


Automated model retraining 


Model retraining is automatically triggered 
after either a set period of time has elapsed 
or the platform completes a chemical discov- 
ery campaign. New data are appended to the 
initial training data and an ensemble of mod- 
els is trained according to the configuration 
file, originally configured during the initial 
Chemprop model construction. Given the rela- 
tively small increase to the dataset from each 
round of discovery campaign, hyperparame- 
ters were not re-optimized when retraining 
the models, maintaining fast model retraining 
times. Additionally, high-performance comput- 
ing resources through MIT SuperCloud for rapid 
model retraining and the updated models were 
returned to the prediction pipeline to start the 
next chemical discovery campaign. 
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Are cryptocurrencies currencies? 
Bitcoin as legal tender in El Salvador 


Fernando Alvarez*, David Argente*, Diana Van Patten* 


INTRODUCTION: The introduction of digital cur- 
rencies is perhaps the most important develop- 
ment in monetary economics in the past decade. 
However, a currency’s defining role is to serve 
as a medium of exchange, and cryptocurrencies 
have yet to be widely adopted as such. This 
study leverages a unique quasinatural exper- 
iment that can shed light on the reasons behind 
this lack of adoption. El Salvador became the 
first country to make bitcoin legal tender; not 
only must bitcoin be accepted as a means of 
payment for taxes and debts, but also busi- 
nesses are required to accept bitcoin as a 
medium of exchange. The government also 
launched an app called “Chivo Wallet,” which 
allows users to digitally trade both bitcoins 
and US dollars (USD, the official currency in 
E] Salvador) without paying transaction fees, 
and provided major adoption incentives such as 
a large bonus for downloaders. Moreover, the 
pandemic provided an additional incentive 
to adopt touchless payments; if bitcoin has a 
chance to be used as a medium of exchange, 
then this setting gave the cryptocurrency a prime 


opportunity. Furthermore, the study of Chivo 


Wallet, a digital currency backed by a central 
bank, is informative to the debate surround- 
ing central bank digital currencies (CBDCs). 


RATIONALE: We conducted a nationally repre- 
sentative face-to-face survey involving 1800 
households in El Salvador and complemented 
its results with an analysis using all transactions 
identified as involving Chivo Wallet leverag- 
ing data from the blockchain. We explored 
whether Chivo Wallet and bitcoin were adopted 
after the government’s “Big Push,” what fac- 
tors deterred adoption by individuals and firms, 
and what insights can be obtained from block- 
chain data. We also analyzed the broader les- 
sons learned from this example. 


RESULTS: We found that bitcoin was not widely 
used as a medium of exchange and usage of 
Chivo Wallet was low. Most downloads took 
place just as the app was launched. Since then, 
adoption and remittances using Chivo Wallet 
have been decreasing over time. These results 
suggest that it is unlikely that the usage of 
bitcoin and Chivo Wallet will increase. Privacy 
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Remittances via Chivo in bitcoin 


and transparency concerns appear to be key 
barriers to adoption. We also documented that 
this technology involves a large initial adop- 
tion cost, has benefits that significantly increase 
as more people use it, and faces resistance from 
firms in terms of its adoption. These findings 
are relevant for countries studying the viability 
of CBDCs and of crypto as a currency. Further, 
our survey sheds light on how it is the already 
wealthy and banked who use crypto, which 
stands in stark contrast with recurrent hy- 
potheses claiming that the use of crypto may 
particularly help the poor and unbanked. An 
analysis relying on all blockchain transaction- 
level data from Chivo allowed us to validate 
and better understand our survey results and 
provided new insights on the dynamics of the 
use of Chivo Wallet. 


CONCLUSION: Despite bitcoin’s legal tender 
status and the large incentives to promote 
Chivo Wallet in El Salvador, the cryptocurrency 
was not adopted at large as a medium of ex- 
change, and digital payments were scarce and 
concentrated. These findings are informative 
about the intrinsic value of cryptocurrencies as 
means of payments and about the scope of 
CBDCs in developing countries. 
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Are cryptocurrencies currencies? 
Bitcoin as legal tender in El Salvador 


Fernando Alvarez‘, David Argente”*, Diana Van Patten2* 


A currency’s essential feature is to be a medium of exchange. This study explores the potential of 
cryptocurrencies to be used in daily transactions in El Salvador, the first country to make bitcoin legal 
tender. The government's “big push” introduced “Chivo Wallet,” a digital wallet sharing features with 
Central Bank Digital Currencies (CBDCs), with perks to use it for trading bitcoins and US dollars. 
Through a nationally representative, face-to-face survey of 1800 households and blockchain data 
encompassing all Chivo Wallet transactions, we document a pattern of low and decreasing usage of 
digital payments and bitcoin. Privacy and security concerns are key adoption barriers, which speaks 
to a policy debate on crypto and CBDCs with anonymity at its core. Additionally, we estimate 

Chivo Wallet’s adoption cost and complementarities among adopters. 


In its first form money is simply any 
commodity ... which any person will 
readily receive, and which, therefore, 
every person desires to have by him in 
greater or less quantity, in order that 
he may have the means of procuring 
necessaries of life at any time. 

- William Stanley Jevons 


he introduction of digital currencies in 

general, and of cryptocurrencies in par- 

ticular, is perhaps the most important 

development in monetary economics in 

the past decade. Cryptocurrencies such 
as bitcoin differ markedly from traditional 
banks. Bitcoin relies on cryptography for secu- 
rity and operates on a decentralized network 
with verifiable transactions, contrasting with 
centralized banks governed by regulations. For 
the unbanked and those reliant on remittances, 
bitcoin presents a potential solution by en- 
abling financial transactions, bridging the gap 
left by traditional banking systems. However, 
a currency’s key and defining role is to serve 
as a medium of exchange (J, 2), and crypto- 
currencies have yet to be widely adopted for 
this purpose (3). 

This study leverages a unique quasinatural 
experiment that can shed light on the reasons 
behind bitcoin’s lack of adoption. On 7 Sep- 
tember 2021, El Salvador became the first 
country to make bitcoin legal tender through 
the “Bitcoin Law.” A legal tender refers to a 
form of payment that is recognized by law as 
valid for settling financial obligations within a 
particular jurisdiction. According to the Bit- 
coin Law, not only must bitcoin be accepted as 
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a means of payment for taxes and outstanding 
debts, but also all businesses are required to 
accept bitcoin as a medium of exchange for all 
transactions (4). The Salvadoran government 
also launched an app called “Chivo Wallet,” a 
custodial wallet app that allows users to dig- 
itally trade both bitcoins and US dollars (USD) 
without paying any transaction fees. The gov- 
ernment also provided major adoption incen- 
tives, such as a large bonus for downloaders 
that could potentially solve the coordination 
failure, and also subsidized fees. Moreover, 
the COVID-19 pandemic provided an addi- 
tional incentive to adopt touchless payment 
methods. If bitcoin has a chance to be used in 
transactions as a medium of exchange, then 
this setting gave the cryptocurrency a prime 
opportunity. 

Furthermore, central banks are considering 
alternatives to enter the era of digital payments. 
Nine of 10 central banks are exploring central 
bank digital currencies (CBDCs), and more than 
half are developing them or running concrete 
experiments (5). A retail CBDC, a digital cur- 
rency backed by a central bank with legal ten- 
der status, shares many features with a fast 
payment system such as Chivo Wallet. More- 
over, because Chivo Wallet allows for pay- 
ments both in bitcoins and in USD, an analysis 
of its implementation is informative to the 
debate surrounding CBDCs, and a comparison 
between bitcoin and USD usage within the 
app is informative about the use of crypto in 
particular. 


Situating the current study and 
key contributions 


Unique monetary episodes can provide valu- 
able insights into the workings of the economy 
and inform future policy-making. Sargent’s 
seminal work on hyperinflations is a prime 
example of this research tradition (6). Our 
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study of the Salvadorean experience follows in 
this tradition by studying an unprecedented 
monetary experiment in which bitcoin be- 
came legal tender and digital currency started 
being traded through Chivo Wallet. Our ex- 
amination shows that the designation of bitcoin 
as legal tender does not imply that it becomes 
a general medium of exchange as defined by 
previous work, i.e., an object “which is habit- 
ually, and without hesitation, taken by anybody 
in exchange for any commodity” (7). Important 
references in the literature argue that “accept- 
ability” makes an object more likely to become a 
medium of exchange and can be influenced by 
government policies (2, 8, 9), and that the state 
can give a currency value by allowing the public 
to use it to pay taxes (1, 0-13). We contribute to 
this long-standing work by documenting that 
accepting a digital currency to pay for taxes 
is not a sufficient condition for it to become 
widely accepted. 

Our work also contributes to the study of 
cryptocurrencies. Empirically, the literature 
has focused on the risks faced by individuals 
(14, 15), arbitrage opportunities and price 
manipulation (16, 17), bitcoin network par- 
ticipants (78, 19), bitcoin’s price fluctuations 
(3, 62), the determinants of asset pricing (20), 
and developing the notion that bitcoin seems 
to function more like a speculative investment 
than a bona fide currency (3). Our results pro- 
vide insights on the characteristics of adopters 
and the bottlenecks of adoption in a setting 
where incentives to adopt are high, fees are 
subsidized, and we have measurable variation 
in determinants such as income and finan- 
cial literacy. This study is also related to the 
growing theoretical literature on cryptocur- 
rencies, which has built models stressing the 
network effects of its adoption (8, 21, 22) 
and the cost of its production (23, 24). Com- 
plementary to these studies, our work quanti- 
fies the fixed costs of adoption along with the 
network effects. 

Further, through the study of Chivo Wallet 
payments in USD, we address the literature on 
CBDCs, in which empirical evidence is scarce 
(25) (26). As in the case of Chivo Wallet, recent 
policy briefs argue that CBDCs should not be 
bearer instruments (27). This is the case, for 
instance, for China’s CBDC (28), and is also the 
case of Chivo Wallet. Moreover, whereas Chivo 
Wallet is not backed by a central bank, it is 
backed by the government and is not required 
to be linked to a bank account, just as would 
be the case with a CBDC. Our work highlights 
several challenges to the implementation of 
CBDCs, such as the role of privacy and trans- 
parency concerns, while suggesting there is a 
role for policies that incentivize adoption given 
the presence of strong complementarities among 
adopters. More broadly, our study relates to 
work on the adoption of payment methods be- 
yond cash (29-32). 
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Research questions 


RQ 1: Were Chivo Wallet and bitcoin actually adopted 
after the government's “big push”? 

RQ 2: What factors deterred the adoption of Chivo 
Wallet and bitcoin by individuals? 

RQ 3: What other broader lessons can be drawn 
from this experiment? 

RQ 4: What were the drivers of adoption by firms? 

RQ 5: What insights can be obtained from block- 
chain data? 


Methods 
The context 


E] Salvador has been the stage for several mon- 
etary experiments. In 2001, the USD became 
legal tender and the country’s only official 
currency (33). Later, on 7 September 2021, 
E] Salvador became the first country to make 
bitcoin legal tender through the Bitcoin Law. 
Although there might be many reasons be- 
hind the decision, when the policy was an- 
nounced, the president stated that it would 
generate jobs, provide financial inclusion, and 
facilitate sending remittances (34). In addition, 
bitcoin was seen as a mechanism to introduce 
people into financial services. For context, only 
one-third of the Salvadorian adult population 
had a bank account at a financial institution in 
2017 (35). Moreover, our survey’s background 
questions show that in E] Salvador, most trans- 
actions are paid in cash; in fact, >50% of people 
only use cash, >70% of adults are unbanked, and 
almost 90% of them do not use mobile bank- 
ing, as reported in fig. S7 (36). We found that 
64.6% of Salvadorans have access to a mobile 
phone with internet, a prerequisite to adopt 
Chivo Wallet (37). 


The Bitcoin Law 


The first article of the Bitcoin Law describes 
its main objective and endows bitcoin with a 
legal tender status (38). It also makes bitcoin 
a unit of account within the country and, ac- 
cording to the theory of chartalism, endows it 
with value by accepting it as a means of pay- 
ment for tax purposes. The Bitcoin Law also 
goes beyond the usual provisions of a legal 
tender, making bitcoin a medium of exchange 
of mandatory acceptance nationwide. Article 
7 reads: “Every economic agent must accept 
bitcoin as payment when offered to him by 
whoever acquires a good or service.” Another 
relevant article of the law is related to how bit- 
coin usage will be implemented in the country. 
In particular, Article 8 mandates the govern- 
ment to provide the means to conduct trans- 
actions using bitcoin. How was the adoption 
of bitcoin facilitated and promoted by the 
state? The government’s answer was “Chivo 
Wallet” (39). 


The Chivo Wallet app 


Just as bitcoin became legal tender, the gov- 
ernment launched Chivo Wallet and an edu- 
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cational campaign on how to use it. This digital 
wallet allows users to convert bitcoins into USD 
and vice versa without a fee and to send or 
receive either currency (40). As shown in fig. 
S4, payments are made through the application 
by entering the recipient’s identification num- 
ber or phone number and the payment amount 
(41). The app can also be used to pay at local 
establishments, is compatible with other bit- 
coin on-chain and Lightning wallets, and con- 
nects with El Salvador’s banking system to 
deposit or withdraw USD from a bank account 
(42). Chivo Wallet can be used by registered 
Salvadorans even if they reside abroad to fa- 
cilitate sending remittances potentially faster 
and at a lower cost than alternative services. 
Chivo Wallet also has a version intended for 
firms, which allows them to charge their cli- 
ents and pay taxes. It does not provide users 
with the key to their bitcoin, which makes it 
a “custodial” wallet in which transactions are 
not anonymous; users are required to enter 
their personal information after downloading 
the app, just as in the case of several CBDCs 
(27, 28). 


Adoption incentives 


Usage of bitcoin in El Salvador is related to 
Chivo Wallet’s adoption, and as an adoption 
incentive, citizens who downloaded the app 
could receive a $30 bitcoin bonus from the 
government, which is a substantial amount 
in this Central American country with a GDP 
per capita of $4131 (43). These $30 bonuses 
were automatically deposited in their wallets; 
however, the money could not be withdrawn 
as cash before first being transferred to an- 
other Chivo Wallet because the bonus was in- 
tended to promote bitcoin usage. As another 
government incentive, users could get a signif- 
icant discount on gasoline if they paid using 
Chivo Wallet (44). Moreover, transactions in 
bitcoin usually involve substantial fees. For in- 
stance, bitcoin ATM fees can range from 5% 
to over 20%, with an average of about 8.5%, 
and transactions in bitcoin reached a fee of 
>$60 USD per transaction in April 2021 and 
an average value of $1.8 USD in February 2022. 
Transactions in bitcoin and conversions from 
bitcoins to USD using Chivo Wallet and cash 
withdrawals at Chivo Wallet ATMs do not 
incur any fees. This can be interpreted as an 
additional government subsidy. In El Salvador, 
payments of public salaries and pensions re- 
main in USD. Allowing for these payments 
to be in bitcoins could have provided another 
adoption incentive (45). 


Bitcoin in other countries 


The lack of access to banking services and 
infrastructure increases the potential of digi- 
tal payments to promote financial inclusion. 
Consistent with this, most of the top 20 coun- 
tries in the 2021 Global Crypto Adoption Index 
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are emerging economies. The Central African 
Republic (CAF) was the second country, after 
El Salvador, to make bitcoin legal tender in 
April 2022, the same month in which Panama 
approved its own Crypto Law (46). High-income 
countries have not been absent from the crypto 
stage. For instance, an Arizona senator proposed 
a bill to make bitcoin legal tender in that state 
in January 2022 (47). 


Measures 


In the midst of a growing interest to promote 
digital currencies among monetary authori- 
ties, El Salvador offers a rare opportunity to 
learn about the potential of cryptocurrencies 
to become a widely used payment method. 
However, access to data poses a challenge be- 
cause El] Salvador’s government reveals only 
selected information (48). To overcome this 
challenge, we conducted surveys to generate 
data that would be otherwise unobtainable. 
This allowed us to measure the adoption of 
respondents based on their characteristics, 
focusing not only on downloads but also on 
usage. 

The survey was face-to-face, nationally rep- 
resentative, and spanned 1800 households 
during February 2022 (49), leading to results 
with a 95% confidence interval and a 1.94% 
margin of error. Respondents were all >18 years 
old, as this is a prerequisite to be eligible to use 
Chivo Wallet. The national survey was con- 
ducted in partnership with CID-Gallup (50). 
Interviewers were trained a week in advance 
to conduct the survey, and we implemented a 
pilot interviewing 50 people to ensure that 
survey questions were clear. Our sample val- 
idation can be found in table $2; the sample 
almost exactly matches total population shares 
in terms of gender, age, districts, and educa- 
tion levels. 

The sample is also representative in terms 
of bank account ownership (57). The national- 
scale and face-to-face nature of the survey 
posed a challenge compared with an internet 
or phone survey. However, both features are 
important in our setting. First, understanding 
adoption patterns requires a sample that in- 
cludes small cities and rural areas; focusing 
on main population centers may bias results. 
Second, because bitcoin’s adoption through 
Chivo Wallet requires access to both a cell 
phone and an internet connection, a survey by 
phone or internet, which relies on respon- 
dents having access to either communication 
method would mechanically underestimate 
adoption costs. Finally, the face-to-face for- 
mat of our survey preserves data quality while 
allowing us to conduct a longer survey with 
more detailed questions than would be fea- 
sible through the phone or internet (52). The 
survey measures sociodemographic varia- 
bles, knowledge about Chivo Wallet, down- 
loads of the app, and usage both in bitcoins 
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and USD. We include details on the specific 
questions in the supplementary materials, 
section D. 

We complement our survey results with an 
analysis using all transactions identified as 
involving Chivo Wallet, leveraging data from 
the blockchain, a distributed public ledger. We 
not only studied overall volumes transacted 
through Chivo Wallet, but separately analyzed 
the patterns of deposits and withdrawals, and 
identified consistencies between the survey 
outcomes and the blockchain results. 


Results 

RQ 1: Were Chivo Wallet and bitcoin actually 
adopted after the government’s “big push”? 
Awareness 


We found that 68% of potential users knew 
about the app’s existence. Most of those who 
were aware of the app learned about it through 
social media, followed by television and radio, 
news, and friends and family, as summarized 
by fig. S8. Almost 78% of those who were aware 
of the app had tried to download it. Most 
downloads happened just as Chivo Wallet was 
launched. Figure 1A shows that 40% of all 
downloads occurred in September 2021 and 
there were virtually no downloads in 2022. The 
latter suggests that our survey was already 
capturing the most relevant share of adopters 
of this digital wallet. 

In terms of heterogeneity, we found that 
banked, educated, and young men were more 
likely to know about Chivo Wallet (table S3), 
as were people who owned a cell phone with 
internet. Moreover, conditional on knowing 
about Chivo Wallet, these characteristics also 
make a person more likely to try to adopt it, as 


A Timing of Adoption: Monthly Downloads 
as a Share of Total Downloads 


Share of Adopters 
2 


documented in Table 1 (53). People with these 
demographics also tended to download the app 
on their own without help (table S4). These 
findings suggest that the introduction of Chivo 
Wallet mainly provided an additional means 
of payment among those already banked in- 
stead of stimulating more financial inclusion 
among the unbanked. 

Not all users agreed with the widespread 
use of Chivo Wallet. Individuals who agreed 
tended to own a mobile phone with internet, 
and were younger and male. Columns 1 to 3 in 
table S5 show that people who agreed with the 
use of Chivo Wallet were 0.3 percentage points 
more likely to download the app, and columns 
4 to 6 show that individuals who were less 
likely to agree also tended to be those who 
needed help installing the app. 


Reasons to download Chivo Wallet 


The key incentive for downloading the app was 
the $30 bonus, which is equivalent to 0.7% of 
annual income per capita. Other reasons deemed 
as the most important were the contactless 
nature of the payment method in the midst 
of the pandemic and the potential to receive 
remittances; fig. S11 summarizes all reasons 
regarded as most important. 


Chivo Wallet usage by households 


Most respondents spent their $30 bonus to 
pay for expenses in bitcoins, and almost 20% 
of those who downloaded the app had not yet 
used their bonus (54). However, most users 
did not keep using Chivo Wallet after spend- 
ing their bonus. Table 2 presents descriptive 
statistics on Chivo Wallet’s usage among those 
who downloaded it and who reported using 


Try downloading Chivo (share of pop.) 
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the app after spending the bonus. A salient fea- 
ture of people who downloaded Chivo Wallet 
and kept using it after spending their bonus 
is that they were more likely to be young, edu- 
cated, male, banked, and much more likely (26%) 
to be using other digital wallets in addition to 
Chivo Wallet to conduct transfers (55). Dis- 
tance to a Chivo Wallet ATM and facing issues 
with the app, however, were not good predic- 
tors of whether the user remained active, sug- 
gesting that these were not the binding barriers 
to sustain usage (56). 

More than half of these “active users” had not 
made a cash withdrawal from a Chivo Wallet 
ATM, although the mean number of withdraw- 
als was 2.59, given the presence of extreme 
values in the right tail (57). The number of 
payments and transfers received or sent was 
also largely driven by very active users in the 
right tail. Deposits in USD is the only statistic 
in which users in the 25th percentile have a 
nonzero value. We can conclude that active 
Chivo Wallet users transact in USD more than 
bitcoins, because the average amount of pay- 
ments and transfers, sent or received, was 
consistently larger in USD. 


Regional variation 


Figure 1B shows important regional variation 
in the probability of downloading Chivo Wallet 
depending on the share of unbanked popula- 
tion in each department. It also benchmarks the 
CAF, the second country to make bitcoin legal 
tender, and Panama, which enacted a crypto 
law in April 2022, with respect to departments 
in El Salvador given our estimates and their 
share of unbanked. Figures S18 and S19 also 
show regional differences in adoption and 
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Fig. 1. Chivo Wallet’s adoption. (A) Timing of adoption: Monthly downloads. (B) Regional variation in adoption as a share of total downloads by share of unbanked 
population. (A) shows the month in which each user in our sample downloaded Chivo Wallet as a share of total downloads. (B) shows the relationship between 

the share of people who have tried to use Chivo Wallet and the fraction of people who do not have access to a bank account in El Salvador, by department. (B) also 
includes, for comparison, the shares of unbanked in Panama and the CAF. 
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Table 1. Adoption of Chivo Wallet. The dependent variable was “have you tried to download 


Chivo Wallet?” 


() 
OMOSEaas 


Cell phone with internet 


(2) (3) (4) 
0.0757** 


(0.023) 


0.0849*** 


0.1168*** 


-0.0969* 


-0.1349*** 


(0.021) 


Department Yes 


Yes Yes Yes 


The sample only includes respondents who knew about the existence of Chivo Wallet. Results in this table rely on a linear 
probability model. Results are robust to other specifications, in particular, columns (1) and (3) of table S12 show the marginal 
effects under a logit model. The regression includes department fixed effects, and each of the controls is obtained from 
survey questions on whether a person owns a financial instrument (unbanked), years of schooling, age, gender, and marital 


status. Standard errors are clustered by department. 


awareness about Chivo Wallet depending both 
on the average income and the share of un- 
banked per “department,” with departments 
being similar to counties. Regions with higher 
levels of development tended to be more active 
using Chivo Wallet. The share of users who 
continued using the application after spend- 
ing the $30 USD bonus in departments such 
as San Salvador and La Libertad, which have 
the highest income per capita in the country, 
was twice as large as in departments with 
low income per capita, such as Usulutan and 
Chalatenango (58). Similarly, departments 
with a larger share of unbanked population 
had as little as half the adoption levels as de- 
partments in which most of the population 
had access to banking services. 

Along similar lines, assuming that the im- 
plementation of a digital wallet is similar in 
other contexts, our estimates allow us to ex- 
plore how other features of adoption would 
manifest in other countries, which could prove 
valuable to policy makers. Given our estimates, 
in the CAF, only 37 to 45% of the population 
would have been aware of the app’s existence, 
8 to 14% would continue using the app given 
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similar adoption incentives as in El Salvador, 
and <2% would use the app for remittances. In 
the case of Panama, income per capita is higher 
than in El Salvador, as is access to banking 
services. We estimate that >95% of the adult 
population in Panama would be aware of the 
technology, between 30 and 56% would con- 
tinue using it after spending the adoption 
incentives, and 10 to 30% would use it for re- 
mittances. The last two estimates are cut in 
half when considering payments in bitcoins in 
either country. 


Role of taxes and remittances 


By law, bitcoin can be used to pay taxes. 
Chartalism implies that endowing a currency 
with the power to pay taxes gives it value as 
a means of exchange. However, only 5% of 
Salvadorans have paid taxes using Chivo Wallet. 
Moreover, in El Salvador, some households 
receive >60% of their income from remit- 
tances, as summarized in fig. S15. Chivo Wallet 
is not widely used to receive remittances from 
abroad; only 3% (8%) of people have received 
remittances in bitcoins (USD) using the app. 
This finding aligns with reports from the 
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Central Reserve Bank of El Salvador, which 
found that only 1.45% of remittances were 
received through digital wallets in March 
2022, and provides external validation to our 
survey (59). 


RQ 2: What factors deterred the adoption of 
Chivo Wallet and bitcoin by individuals? 
Chivo Wallet adoption deterrents 


More than 21% of respondents knew about 
Chivo Wallet but did not try to download it. 
The reasons not to download it are summar- 
ized in fig. S13A. The most important reason 
was that users preferred to use cash. The sec- 
ond most relevant reason not to download 
Chivo Wallet were trust issues: Respondents 
did not trust the system or bitcoin itself (60). 
Privacy and security are at the heart of the 
debate around CBDCs and bitcoin. Concerns 
regarding lack of anonymity and secure trans- 
actions could then explain, at least partially, 
the main two reasons not to download the 
app, because cash is an anonymous payment 
method (67). The next most frequent reason 
mentioned was not owning a phone with in- 
ternet, followed by the technology being com- 
plicated. In sixth place, Salvadorans mentioned 
technical difficulties using the app; fig. S14 
summarizes the main reported problems. 


Bitcoin adoption deterrents 


Figure S13B reports the main reasons why in- 
dividuals do not use bitcoin. For >50% of re- 
spondents, the main reason not to use bitcoin 
was that they did not understand it nor trust 
it. Although the volatility of bitcoin has poten- 
tial as a deterrent (62), trust and transparency 
seem to be more salient than uncertainty, be- 
cause bitcoin’s volatility was mentioned by <10% 
of respondents. If volatility were the main de- 
terrent from using Chivo Wallet, then we 
should then see people downloading the app 
and transacting in USD, which are very stable; 
however, this was not the case, as explained in 
the previous paragraph. 


Taking stock 


Figure 2A summarizes results from our first 
two research questions. We documented that 
more than two-thirds of Salvadorans were 
aware that Chivo Wallet exists. However, not 
all of those who knew about the app have tried 
to download it; just over half of all respond- 
ents did so. The main reason not to download 
Chivo Wallet was that individuals prefer to pay 
in cash, followed by mistrust; these motifs 
may be related to privacy concerns. The main 
reason that they downloaded the app was to 
use the $30 bonus offered by the government, 
but less than half of those who were able to 
download Chivo Wallet, 20% of adult citizens, 
continued to use it after spending the bonus 
and they mostly used it to transact in USD, not 
in bitcoins. 


4 of 9 


RESEARCH | RESEARCH ARTICLE 


Table 2. Descriptive statistics: Active Chivo Wallet users. 


(1)Mean (2)SD (3)10th (4) 25th (5)Median (6) 75th (7) 90th 
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Average amount of ATM withdrawals (in USD) 54.9 65.6 10 20 30 60 120 _ 


Average amount of payments/transfers sent in bitcoin (in USD) 328) 


nsfers received in bitcoins 


unt of payments/transfers received in bitcoin (in USD) 


Deposits in USD 44 13.8 0 0) il 2 10 


The table shows distribution of responses to the questions: (i) How many times per month do you withdraw money from Chivo Wallet ATMs?; (ii) What is the average amount of your ATM 
withdrawals?; (iii) How many payments or transfers do you perform per month using Chivo Wallet in bitcoins or in USD?; (iv) What is the average amount of your payments or transfers in bitcoins 
or in USD?; (v) How many payments or transfers did you receive per month using Chivo Wallet in bitcoins or in USD?; (vi) What is the average amount of your payments or transfers you received 
in bitcoins or in USD?; and (vii) How many times have you deposited money to your Chivo Wallet in bitcoins or in USD? We divided the number of deposits by the months a person was active in 
Chivo Wallet to convert them to a monthly variable and rounded the values to the closest integer. The sample includes those who kept using Chivo Wallet after spending their $30 bonus (20.6% of 
respondents). We dropped observations above the 99th percentile to avoid extreme outliers. 


B Acceptance and Use of Bitcoin 
Among Firms 


A Awareness and Individual Use 
of Chivo Wallet 
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Fig. 2. Taking stock. (A) Awareness and individual use. (B) Acceptance and use of bitcoin with Chivo Wallet among firms. (A) shows shares with respect to the entire 
sample, so it is subject to a 1.94% margin of error. In (B), the top two bars show percentages with respect to all surveyed owners and employees who knew about 
payment methods at the firm. The bottom two bars show percentages with respect to total sales. 


Moreover, most individuals who used Chivo 
Wallet after spending the bonus did not engage 
with the app intensively; the median user re- 
ported no ATM withdrawals and no payments, 
sent or received, in bitcoin in a given month. To 
put this in perspective, the median number of 
daily transactions per person across means of 
payments was between 1.3 and 14 in several 
countries (63), and Chivo Wallet’s developer 
indicates that there are 0.001 to 0.003 daily 
transactions per adult (64). Further, we did 
not find evidence of Chivo Wallet being used 
to pay for taxes or to send remittances at a 
substantial scale. Figure S20 replicates Fig. 2A 
for the share of the banked and unbanked 
population, respectively. 
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Overall, we have documented that bitcoin is 
not being widely used as a medium of ex- 
change and that Chivo Wallet’s usage is low in 
E] Salvador. The latter stands despite the “big 
push” exerted by the government, which in- 
volved endowing bitcoin with legal tender status 
through the Bitcoin Law, the $30 bonus, gas 
discounts, and no fees, and despite the pan- 
demic’s incentive to use touchless payment 
methods. 


RQ 3: What other broader lessons can be drawn 
from this experiment? 
Complementarities 


For some technologies, the benefit of adopt- 
ing increases as more people adopt (65). Ar- 
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guably, such complementarities, also called 
network externalities, are an inherent feature 
of digital payment methods and give a po- 
tential role for policy to improve allocations 
(66). Thus, we can draw broad lessons appli- 
cable to other payment technologies from the 
analysis of Chivo Wallet. We found evidence 
of complementarities, both in the decision 
to adopt the app and on how intensively peo- 
ple used it, as reported in the supplementary 
materials, section C. 


Adoption and variable costs 


We leveraged the familiarity with the $30 in- 
troductory bonus and asked two questions 
to estimate the distribution of (self-reported) 


5 of 9 


RESEARCH | RESEARCH ARTICLE 


adoption costs. The first question was: “How 
large does the bonus need to be to convince 
you to download Chivo Wallet?,” which was 
directed to people who had not downloaded 
the app, but knew about it (14.5% of respon- 
dents). The second question was: “What is the 
minimum bonus that would have convinced 
you to download Chivo Wallet?,” which was 
directed to people who had downloaded the 
app (53.5% of respondents). Table S7 displays 
our results. Although the mean reported val- 
ue was $30, the median user would have ac- 
cepted $20 USD, and there were people in 
the 10th percentile who would have adopted 
it even without a bonus. The adoption cost was 
larger for individuals with certain demograph- 
ics: Unbanked respondents reported $6.9 USD 
higher cost than those who were banked, peo- 
ple without a cellphone with internet reported 
a $8.6 USD higher cost than those with one, 
it was $2.9 USD costlier for households with 
only elementary education to adopt compared 
with those with education beyond elemen- 
tary, and finally, women reported a $8.9 USD 
higher cost than men. 

Chivo Wallet allows users to withdraw cash 
from Chivo Wallet ATMs and convert bitcoin 
into USD without a fee. However, outside of 
Chivo Wallet, most providers charge significant 
fees. Table S8 shows the maximum reported 
willingness to pay to withdraw $100 USD ata 
Chivo Wallet ATM was $3.3 USD on average. 
This amount is less than half of the mean fee 
to purchase cash at bitcoin ATMs outside of 
E] Salvador. Moreover, the median respondent 
was willing to pay only $1 USD. These findings 
suggest that Chivo Wallet users would not en- 
gage in cash withdrawals if they faced non- 
subsidized fees. Table S8 also reports that the 
average willingness to pay to convert bitcoins 
into USD was $2.9 USD, and the median user 
would be willing to pay only $0.05 USD. These 
amounts are much smaller than any transac- 
tion cost of exchanges, indicating that Chivo 
Wallet would not be used in the absence of the 
subsidies. 


Impact on usage of other payment methods 


If users adopt Chivo Wallet, then they might 
substitute it for other payment methods such 
as cash and cards. We found some evidence 
consistent with this substitution. We docu- 
mented that 10% of users who downloaded 
Chivo Wallet decreased their use of cash and 
11% reduced their use of debit cards (67). The 
government also offered a discount of ~8% per 
gallon for gas purchases with Chivo Wallet, 
which allowed us to measure the elasticity of 
substitution, which measures how easily people 
switch between Chivo Wallet and other pay- 
ment methods, as detailed in the supplementary 
materials, section C. Although the sample size is 
small, the estimated elasticities of substitution 
are positive and large, which suggests that the 
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welfare costs of policies disincentivizing other 
payment methods (such as cash) are lower if 
digital payments are available. 


RQ 4: What were the drivers of adoption 

by firms? 

The Bitcoin Law states that all economic agents 
must accept bitcoin, but this does not neces- 
sarily translate into all firms effectively doing 
so (68). To study the extent to which firms ac- 
cepted bitcoin, we relied on a subset of re- 
spondents who identified themselves as owners 
of firms or as employees who knew about the 
payment methods accepted by their employer, 
who then answered a series of questions about 
their business. Results are summarized in Fig. 2B. 

First, we documented that whereas almost 
all firms accepted cash, slightly over 20% ac- 
cepted bitcoin (69). Among those that did 
accept bitcoin, 75% started accepting it just 
as the law was enacted. Only 11.4% of firms 
had positive sales in bitcoin. This estimate 
aligns with results from two independent sur- 
veys targeting firms of all sizes and across 
sectors (70). Further, our survey indicates that 
81% of firms accepting bitcoin have not seen 
a change in their sales since starting to accept 
it, and whereas the median firm made no sales 
in bitcoin, 4.9% of all sales were paid in bitcoin 
through Chivo Wallet, mainly to large firms. 
These estimates align with those by two in- 
dependent local surveys (77). 

Second, we documented that firms accept- 
ing bitcoin were mostly large and in the fifth 
quintile of the firm size distribution (72). These 
large firms were also more likely to accept cards. 
Third, most firms reporting sales in bitcoin 
converted them into USD: 71% converted sales 
into USD and then withdrew them as cash, 
17% converted sales into USD and kept them 
in Chivo Wallet, and only 12% of firms stored 
their sales in bitcoin within Chivo Wallet. Fi- 
nally, we found that 11% of firms have increased 
prices since bitcoin became legal tender, which 
is consistent with the hypothesis that firms 
might be transferring costs related to the crypto- 
currency (e.g., volatility) to customers (73). 


RQ 5: What insights can be obtained from 
blockchain data? 


So far, our conclusions discussed have been 
drawn from the survey data that we collected. 
This section leverages that all bitcoin transac- 
tions are recorded on the blockchain, a dis- 
tributed public ledger, to analyze Chivo Wallet’s 
activity based on actual transaction data. This 
exercise allowed us to validate and better un- 
derstand our survey results. The analysis using 
transaction-level data for all of Chivo Wallet’s 
transactions on the blockchain also provides 
new insights into how bitcoin transactions are 
carried out in El Salvador and by whom. Data 
sources are detailed in the supplementary ma- 
terials, section E. 
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It is important to understand which Chivo 
Wallet transactions would surface in the block- 
chain and which ones would not. As of today, 
verifying a bitcoin transaction on the block- 
chain is both costly and takes several hours 
(74). Given this constraint, many wallets that 
use bitcoin for relatively small payments do 
not verify all transactions on the blockchain. 
Instead, they are custodial wallets and rely 
on a clearing house. Chivo Wallet is no excep- 
tion; therefore, transactions from one Chivo 
Wallet to another one, in general, would not 
register on the blockchain. Transactions be- 
tween different addresses owned by Chivo 
Wallet as an entity do appear on the blockchain, 
and we label them as internal transactions (75). 
Transactions from Chivo Wallet to external 
crypto wallets also surface in the public ledger. 
These would include, for example, payments 
from tourists visiting El Salvador and paying 
in bitcoin from their foreign wallets. 

According to our data, as of 3 November 2022, 
Chivo Wallet was associated with 142,148 ad- 
dresses, which were involved in 425,514 trans- 
actions and a total of 3,424 bitcoins deposited 
into Chivo Wallet. These are all the transac- 
tions that can be identified as involving Chivo 
Wallet either as a buyer or a seller of bitcoin. 
Figure 3 summarizes some of the observed 
dynamics. As shown in Fig. 3A, the total trans- 
actions in bitcoins, expressed in USD, reached 
their peak between October and December 
2021 and decreased significantly thereafter. 
The latter is consistent with the results of our 
survey, which document high activity within 
the first months of Chivo Wallet’s operation 
and a sharp decrease thereafter. 

Figure 3A shows all activity, whereas Fig. 3B 
considers only external transactions and de- 
composes them as total deposits into and 
withdrawals from Chivo Wallet (76). First, the 
co-movement between both types of external 
transactions was substantial. Second, an anal- 
ysis of the average size of each type of transac- 
tion, reported in fig. S24, shows that deposits 
were composed by many small and relatively 
frequent transactions; for example, these could 
be transactions from tourists visiting El] Salvador 
to use bitcoin or residents from El Salvador 
who had bitcoin in other wallets (77). Their 
active behavior resembles the one by the right 
tail of Chivo Wallet users who were extremely 
active, as documented in table S6. The magni- 
tude of inflows of bitcoin in the survey and on 
the blockchain data also align. According to 
our survey, between $221,000 and $334,000 
USD flowed into Chivo Wallet per day, where- 
as according to blockchain data, this amount 
was ~$245,000 USD per day (78). Third, a joint 
analysis of Fig. 3B and fig. S24 shows that with- 
drawals (i.e., sales of bitcoin by Chivo Wallet) 
tended to be large and happen rarely, and in 
synchrony, with the pace of accumulated de- 
posits. This pattern suggests that withdrawals 
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Fig. 3. Chivo Wallet’s blockchain transactions. (A) Total transactions, both internal and external. (B) External deposits and withdrawals (in USD). (A) shows the 
total number of transactions in Chivo Wallet, including internal transactions and external withdrawals and deposits in USD. We converted bitcoin's value into USD 
because otherwise the patterns would also reflect bitcoin’s price changes, which were substantial in this period. (B) shows the dynamics of external withdrawals and 
deposits. The vertical dashed lines date moments when El Salvador’s government announced a bitcoin purchase. 


occured as part of Chivo Wallet’s bitcoin in- 
ventory management, such that Chivo Wallet 
accumulated balances of bitcoin to lower the 
transaction cost of selling them. This behavior 
is consistent with the almost zero net accu- 
mulation of bitcoin within the wallet shown in 
fig. S25. This behavior resembles the one dis- 
played by firms in El Salvador, which tended 
to convert all of the bitcoin they received into 
USD almost immediately. Finally, the data 
verify that trading volumes in Chivo Wallet are 
uncorrelated with bitcoin prices; thus, Chivo 
Wallet trading volumes seem to be driven by 
idiosyncratic reasons rather than by bitcoin 
market prices; section E of the supplemen- 
tary materials provides more details on this 
relationship. 


Discussion 


Following the tradition of studying unique 
monetary episodes to inform policymaking, our 
analysis of the Salvadorean experience with 
bitcoin as legal tender offers valuable insights 
into the complexities of the adoption of crypto- 
currencies as a medium of exchange and the 
implementation of CBDCs. El Salvador’s govern- 
ment provided a “big push” to incentivize the 
use of digital payments and bitcoin, including 
a large sign-up bonus and subsidized fees. 
Bitcoin is not only endowed with legal tender 
status, allowing the currency to be used to pay 
taxes and debts, but also must be accepted by 
any economic agent by law. Monetary theories 
such as chartalism suggest that these condi- 
tions should be sufficient for bitcoin to be- 
come a medium of exchange. 

However, our results show that, despite all 
incentives and the enhanced attractiveness of 
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contactless payments in the midst of the pan- 
demic, bitcoin is being not widely used as a 
medium of exchange and usage of Chivo Wallet 
is low. Most downloads took place just as Chivo 
Wallet was launched. Since then, adoption and 
levels of remittances through Chivo Wallet have 
been decreasing over time. These results sug- 
gest that it is unlikely that usage of bitcoin and 
Chivo Wallet will increase. Our empirical results 
challenge the implications of chartalism. 

Privacy and transparency concerns appear 
to be key barriers to adoption; unexpectedly, 
these are the two concerns that decentralized 
currencies such as crypto aim to address. More- 
over, we document that this payment technol- 
ogy involves a large initial adoption cost, has 
benefits that significantly increase as more 
people use it (.e., complementarities), and faces 
resistance from firms in terms of its adoption. 
Our findings lay out the challenges faced by 
digital payments and cryptocurrencies to be- 
come widely accepted, and are relevant for 
countries studying the viability of CBDCs and 
of crypto as a currency. Moreover, our survey 
work using a representative sample sheds light 
on how it is the already wealthy and banked 
who use crypto, which stands in stark contrast 
with recurrent hypotheses claiming that the 
use of crypto may help the poor and unbanked 
the most. 

There is substantial heterogeneity across dem- 
ographic groups in the likelihood of adopting 
and using bitcoin as a means of payment. The 
reasons that young, educated men are more 
likely to use bitcoin for transactions remain 
an open question. One hypothesis is that this 
group has higher financial literacy. We found 
that, even conditional on access to financial 
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services and education, young men were still 
more likely to use bitcoin. However, financial 
literacy encompasses several other areas of 
knowledge that are not captured by these con- 
trols. An alternative hypothesis is that young, 
educated men have a higher propensity to 
adopt new technologies in general. The litera- 
ture on payment methods has documented 
that young individuals have a greater propen- 
sity to adopt means of payment beyond cash, 
such as cards (87). Nevertheless, further re- 
search is necessary to causally identify the 
factors contributing to the observed hetero- 
geneity across demographic groups. An anal- 
ysis relying on all blockchain transaction-level 
data from Chivo Wallet allowed us to validate 
and better understand our survey results and 
is a unique opportunity to provide new in- 
sights on the dynamics of Chivo Wallet’s ac- 
tivity. The latter is valuable because the app 
is a unique exchange in that it can also be used 
as means of payment by law. 

Furthermore, the results carry policy impli- 
cations for other countries. A study of this 
experience is informative in drawing broader 
lessons on the likelihood of success of CBDCs 
and cryptocurrencies in contexts outside of E] 
Salvador. Assuming that the implementation 
of a digital wallet is similar in other contexts, 
our estimates allow us to explore what would 
be the adoption of the technology in other 
countries, which can prove valuable to policy 
makers. Two interesting cases are the CAF, 
which recently made bitcoin legal tender and 
has a stable local currency, as well as Panama, 
which also enacted a Crypto Law and where 
the USD is the official currency, as in El 
Salvador. El Salvador falls in between these 
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countries in terms of both income per capita 
and access to banking services (79). The in- 
troduction of a cryptocurrency could lead to 
outcomes different from the ones we docu- 
mented in countries where the local currency 
is unstable and there are restrictions on capi- 
tal mobility, such as Argentina and Turkey. 
Thus, an analysis of these contexts may offer 
fertile ground for future research to explore. 


Overall, we conclude that despite bitcoin’s 


legal tender status and the large incentives to 
promote Chivo Wallet, the cryptocurrency is 
not adopted at large by the population as a 
medium of exchange and digital payments are 
scarce and concentrated. These findings are 
informative about the intrinsic value of crypto- 
currencies as means of payments, as viewed 
in the larger context of monetary models in 
economics and about the scope of CBDCs in 
developing countries. 
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Users can withdraw USD from their wallet either by doing a 
ransfer from their bank account or by withdrawing cash from 
a Chivo Wallet ATM without a fee. As of September 2021, there 
were 200 Chivo Wallet ATMs in El Salvador (see figs. S5 and 
S6), and 51 in the US. Similarly, users can load money into 
heir wallets through an official website using a credit or debit 
card or with cash through Chivo Wallet ATMs. Although funds 
remain in Chivo Wallet, they represent a claim to either USD 
or bitcoins, which is not uncommon in payment platforms. 

n other words, both USD and bitcoins are a parallel digital 
asset with a fixed exchange rate. In Chivo Wallet, the price of 
bitcoin is adjusted in real time to its market price. For instance, 
a customer could pay a firm or another user the USD price 
of an item in bitcoins, and the app would use the real-time 
exchange rate to charge her. 

The Lightning Network is a protocol that uses temporary 
payment channelsoperating off-chain. After a channel is 
closed, payments are validated on the blockchain. 

World Bank, “GDP per capita (current US$)" (2020); 
https://data.worldbank.org/indicator/NY.GDP.PCAP.CD. 

Major gas stations dropped the gallon price by $0.20 for 
customers who paid with Chivo Wallet between September 
and October, and another drop of $0.30 per gallon was 
announced in November. 
For instance, during the 2001-2002 Argentinean crisis, several 
provinces introduced low- denomination bonds (“quasi 
money”) and used them to pay wages and other inputs (86). It 
is worth noting that El Salvador is relatively small and is 
therefore a bitcoin price-taker; indeed, fig. S3 shows there 
were no large changes in the (global) price of bitcoin after the 
Bitcoin Law or Chivo Wallet’s launching. Thus, the experiment 
speaks about whether bitcoin is used as a means of payment 
given the above-described incentives despite the fact that it 
has a given resale value. 

Panama and CAF are benchmarked against regions in E 
Salvador in Fig. 1B. 

Bill SB 1341 was introduced by state Sen. Wendy Rogers. 
Most information comes from the president's Twitter account. 
We tried, unsuccessfully, to contact multiple governmen 
entities, including Chivo Wallet customer service, El Salvador’s 
Superintendency of the Financial System, Central Bank, and 
Casa Presidencial, to receive more quantitative information. 
In terms of the timing of the survey, information was collected 
across several weeks always including weekends; weekends 
were important in reaching a representative sample of profiles. 
CID-Gallup has been conducting surveys in Latin America for 
>40 years. It has an office in El Salvador that periodically 
conducts large-scale surveys. 
Total population shares match the General Directorate of 
Statistics and Censuses’ 2021 projections. 
Approximate survey length was 27 min. To obtain candid 
responses, respondents were guaranteed confidentiality and 
notified that the survey aimed to inform academic research. 
able 1 relies on a linear probability model. Results are robust 
‘o other specifications, in particular, columns (1) and (3) of 
able S12 show the marginal effects under a logit model. 
According to Chivo Wallet’s regulations, users must spend 
heir bonus in bitcoins to incentivize its usage. Some people 
ound ways to circumvent this restriction; for instance, sending 
he bonus to a family member and asking her to withdraw 

he money from a Chivo Wallet ATM. 

hese findings regarding the prominence of young adoption is 
consistent with (87). 

able S6 shows no evidence of technical issues with the app 
being a concern by constructing a dummy equal to one if the 
user faced problems using the app. 
Figs. S5 and S6 show Chivo Wallet ATM locations. Fig. S12 
displays mean distances to a Chivo Wallet ATM across 
population shares. 
n general, extending income adoption relations requires 
caution, as countries with higher income, such as Panama, 
may have higher adoption of digital payments (e.g., card or 
mobile) and, thus, lower incentives to adopt a Chivo Wallet 
ype of service (88). However, the adoption of digital payments 
in Panama was similar to that in El Salvador; in Panama, 
3.3% of people over 15 years of age report having borrowed 
rom a financial institution or used a credit card, whereas 
1.5% is the corresponding percentage in El Salvador. 
Moreover, in both countries, 6.5% of people over 15 years of 
age have made a payment using their mobile phone or 
internet according to the World Bank's G20 Financial 

nclusion Indicators. 
Fig. S2 reports official monthly data on remittances in bitcoins, 
and fig. S17 summarizes our results. 
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Mistrust is also the main reason not to agree with the use of 
Chivo Wallet (fig. S16). 

Note that, in the US, apps to trade bitcoin are required to 
gather information on the identity of the trader, so bitcoin is 
not associated with anonymity in the US, just as in 

EI Salvador's case. 

D. G. Baur, K. Hong, A. D. Lee, Bitcoin: Medium of exchange or 
speculative assets? J. Int. Financ. Mark. Inst. Money 54, 
177-189 (2018). doi: 10.1016/j.intfin.2017.12.004 

J. Bagnall et al., “Consumer cash usage: A cross-country 
comparison with payment diary survey data” (Tech. Rep., 
Social Science Research Network Electronic Journal, 2014); 
http://dx.doi.org/10.2139/ssrn.2436365. 

Source: “Chivo Wallet registra un promedio de 6,000 trans- 
acciones por d‘la, segu’n experto argentino,” Diario El Mundo, 
November 2021. Estimate obtained based on an adult 
population of 4.3 million. 

F. Alvarez, D. Argente, F. Lippi, E. M’endez, D. Van Patten, 
“Strategic complementarities in a dynamic model of 
technology adoption: P2P digital payments” (Working Paper 
31280, National Bureau of Economic Research, 2023); 
https://doi.org/10.3386/w31280. 

The diffusion of many technologies is also shaped by learning; 
this mechanism, however, does not necessarily create an 
externality or room for policy interventions to improve outcomes. 
More details are reported in fig. S21. 

Businesses that refuse to accept bitcoin are operating in 
violation of local regulations and exposed to sanctions 

under the Consumer Protection Law; our survey points to 
enforcement on firm adoption being imperfect. 

The share that accepts cards is only a little over 25%. 

Even among firms that accept bitcoin, prices were quoted in USD 
and the Chivo Wallet app provided real-time bitcoin equivalents. 
First, a survey ran by the Salvadoran Foundation for 
Economic and Social Development (FU- SADES) toward the 
end of 2021 indicates that 10% of businesses have made 
sales in bitcoin (‘Institutional Position N.106,” FUSADES, 
December 2021). Second, the Chamber of Commerce and 
Industry of El Salvador (Camarasal) conducted a survey in 
February 2022 reporting that 13.9% of businesses have 

made sales in bitcoin (‘First Business Survey 2022,” Camarasal, 
March 2022). 

The Chamber of Commerce and Industry of El Salvador reports 
a similar estimate of firms that have not changed their sales, 
and (91.7%) the Salvadoran Foundation for Economic and 
Social Development estimates that the share of sales paid in 
bitcoin is between 1 and 5%. 

Table S9 shows results robust to controlling for the sector of 
the firm. Findings are very similar if only including responses 
from the firm's owner or from an employee who reports 

to work in sales. 

Fig. S22 shows (i) a summary of the results on prices from 
the consumer's perspective (21% have encountered higher 
prices at some businesses) and (ii) the full distribution 

of shares of sales in bitcoin across firms. Fig. S23 summarizes 
findings on firms. 

Although it can be verified faster, this extra speed incurs an 
additional cost. 
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Although one entity can own several addresses, these are not 
ransactions between Chivo Wallets owned by individuals. 
hus, this figure considers transactions that involve an address 
hat can be identified as Chivo Wallet and another address. 
he fees paid for these deposits tended to be higher closer to 
Chivo Wallet's launch (see fig. S26), which would be consistent 
with more urgency from bitcoiners trying to pay for goods 
and services when Chivo Wallet’s hype was at its peak. 
hroughout the period, fees for deposits into Chivo Wallet 
ended to be higher than those paid for withdrawals, which 
also points to more urgency on the deposits’ front compared 
with withdrawals. The data indicate that Chivo Wallet mostly 
ransacted with well-known exchanges; the main one being 
Binance (12% of all the volume transacted), followed by Bitso, 
OKX, and Coinbase. 
Flows from the blockchain data have a standard deviation of 
84,300. To calculate these flows using our survey, we focused 
on inflows of bitcoin into Chivo Wallet from other wallets, 
because these are the transactions recorded on the block- 
chain. Thus, our population of interest consists of individuals 
who have deposited bitcoin into Chivo Wallet and have 
transferred bitcoins to wallets other than Chivo Wallet, ~2% of 
the adult population of El Salvador. For this sample, we computed 
total deposits per day as the difference between the total amount 
sent per day and the total amount received per day in the app, 
including transactions in both USD and bitcoins, because 
convertibility across currencies is free within the app. To estimate 
the total deposits in bitcoins per day, we multiplied total 
deposits times the share of deposits in bitcoins (17.3%). 

The CAF has an income per capita of ~$418 USD and Panama 
of approximately $12,172 USD, and as in El Salvador, the 
alternative to bitcoin is a stable currency. Approximately 13.7% 
of the population in the CAF has access to a bank account, 
whereas in Panama this number is ~46.5% (Fig. 1B). 
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INTRODUCTION: During Drosophila aversive ol- 
factory conditioning, flies are exposed simul- 
taneously to an odor and aversive electrical 
shocks. Flies learn to associate the odor with 
the aversive stimulus and subsequently avoid 
the odor. Learning and memory of this asso- 
ciation requires a brain region known as the 
mushroom bodies (MBs). Odor information is 
conveyed to the MBs by cholinergic projection 
neurons, which originate in the antennal lobes 
and synapse onto the dendrites of MB neu- 
rons. However, the cells and pathways that 
convey aversive shock information to the MBs 
have been uncertain. Activation of dopamin- 
ergic (DA) pathways that innervate the axons 
of MB neurons can bypass the requirement for 
aversive stimulation during conditioning, sug- 
gesting that DA may convey this information. 
On the other hand, dopamine is required for 
memory reinforcement and synaptic plasticity, 
complex functions that should occur after the 
transmission of aversive shock information 


Aversive stimuli 


EG DAN 


to the MBs. Recent ex vivo imaging studies 
have suggested that glutamate (Glu) binding to 
N-methyl-p-aspartate (NMDA)-type Glu recep- 
tors (NMDARs) on MB neurons conveys shock 
information to the MBs, while DA functions 
subsequently to induce plasticity and reinforce 
associative memories. 


RATIONALE: NMDARs are expressed abundant- 
ly on the MBs and are required for forming 
associative memories. However, presynaptic 
Glu terminals—identified by expression of the 
vesicular Glu transporter (VGLUT), which 
transports Glu into synaptic vesicles—are 
extremely sparse in the MBs. This suggested 
that unidentified Glu terminals expressing an 
uncharacterized VGLUT may play a role in the 
transmission of shock information to the MBs 
and subsequent memory formation. 


RESULTS: We identified a previously unknown 
DVGLUT, DVGLUT2, through sequence homol- 
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Ensheathing glia transmit aversive information. Aversive stimuli induce vesicular exocytosis from 
EG and release of Glu onto KAR on dopaminergic neurons (DANs) and NMDARs on MB neurons. 
(Left) When aversive stimuli are unpaired, Mg?* block prevents Ca?" influx. (Right) When paired 
with odors, odor-dependent depolarization removes Mg?* block, allowing Ca?* influx into appropriate 


MB neurons. 
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ogy and verified that it functions as a Glu 
transporter. DVGLUT2 is not expressed in neu- 
rons but instead is associated with vesicle-like 
structures in ensheathing glia (EG), which sur- 
round neuropil structures, including the MBs. 
Aversive electrical stimulation induces rapid 
Ca?* transients in EG, vesicular exocytosis, 
and Glu release. Although EG surround all 
lobes of the MBs similarly, Glu release only 
occurs in select MB compartments. Other aver- 
sive stimuli, including bitter taste and noxious 
heat stimuli, also induce Glu release from EG 
in similar patterns. Blocking exocytosis from 
EG during conditioning impairs aversive learn- 
ing, whereas artificial activation of EG can 
replace aversive stimuli during conditioning. 
Glu released from EG binds to NMDARs on 
MB neurons and kainate receptors (KARs) on 
DA neurons. Because of the Mg”* block mech- 
anism of NMDARs, Glu binding on its own is 
unable to induce Ca”* influx into MB neurons. 
However, coincidental activation of MB neu- 
rons by both odors and aversive stimuli during 
aversive conditioning is sufficient to remove 
Mg”* block and induce NMDAR-mediated Ca?* 
influx, resulting in synergistic activation of MB 
neurons by odors and shocks. Glu binding to 
KARs is sufficient to induce Ca** influx into 
DA neurons and DA release onto MB neurons, 
but this release is not required for aversive 
learning after normal odor-shock paired con- 
ditioning. Instead, KAR-dependent DA release 
is required to prevent aberrant learning when 
shock onset precedes odor onset during back- 
ward conditioning. 


CONCLUSION: Our results are consistent with 
a model in which shock information transmit- 
ted from EG bifurcates into two pathways. 
One pathway consists of Glu release onto 
NMDABRs on MB neurons, and the second 
pathway induces the release of DA from DA 
neurons upon aversive but not neutral or ap- 
petitive stimulation. Glu released onto MB 
neurons works synergistically with odor sig- 
nals and is required for learning, whereas Glu 
released onto DA neurons prevents inap- 
propriate learning in situations in which an 
odor cannot be used to predict shocks. Overall, 
our data suggest that vesicle release of Glu 
from EG serves to transmit aversive informa- 
tion to higher-order brain areas and also func- 
tions to separate aversive sensory information 
from reinforcement signals during the forma- 
tion of associative memories. 
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1,3% 


During Drosophila aversive olfactory conditioning, aversive shock information needs to be transmitted 
to the mushroom bodies (MBs) to associate with odor information. We report that aversive information 
is transmitted by ensheathing glia (EG) that surround the MBs. Shock induces vesicular exocytosis 

of glutamate from EG. Blocking exocytosis impairs aversive learning, whereas activation of EG can 
replace aversive stimuli during conditioning. Glutamate released from EG binds to N-methyl-p-aspartate 
receptors in the MBs, but because of Mg** block, Ca** influx occurs only when flies are simultaneously 
exposed to an odor. Vesicular exocytosis from EG also induces shock-associated dopamine release, 
which plays a role in preventing formation of inappropriate associations. These results demonstrate 
that vesicular glutamate released from EG transmits negative valence information required for 


associative learning. 


n Drosophila aversive olfactory condition- 

ing, flies are simultaneously exposed to an 

odor and aversive electrical shocks. Flies 

learn to associate the odor [the condi- 

tioned stimulus (CS)] with the shocks [the 
unconditioned stimulus (US)] and subsequent- 
ly avoid the odor. This association is formed 
and stored in the mushroom bodies (MBs). 
Odor information is delivered to the MBs from 
the antennal lobes by cholinergic projection 
neurons that synapse onto MB neuron den- 
drites located in the calyces. However, the 
cells and transmitters that deliver US informa- 
tion to the MBs have been unclear. Prevailing 
models propose that US information is trans- 
mitted by dopaminergic neurons (DANs) (J-3), 
but recent ex vivo imaging studies have sug- 
gested that US information may be transmit- 
ted to N-methyl-p-aspartate receptors (NMDARs) 
in the MBs by glutamatergic (Glu) neurons 
(4, 5). However, Drosophila vesicular glutamate 
transporter (DVGLUT) staining is largely ab- 
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sent in the MBs, even though NMDARs are 
abundantly expressed in the MBs (6-9). This 
suggested that uncharacterized glutamater- 
gic cells expressing a previously unidentified 
DVGLUT may transmit aversive information 
to the MBs. 


Results 
DVGLUT2 is expressed in ensheathing glia 


We performed BLAST (Basic Local Alignment 
Search Tool) searches and identified CG4288 
on the basis of its homology to mammalian 
VGLUTs. CG4288 had been previously reported 
as a potential DVGLUT (10), and its amino 
acid sequence is highly homologous to se- 
quences of mouse VGLUT1-3 (figs. S1 and 
S2A). Notably, CG4288 possesses 12 trans- 
membrane domains similar to those of other 
VGLUTs and conserved arginine (R) and glu- 
tamine (E) residues in transmembrane do- 
main 4 that are required for Glu transport 
(11). To determine whether CG4288 functions 
as a vesicular Glu transporter, we inserted re- 
combinant protein into artificial lipid mem- 
brane vesicles and measured the incorporation 
of °H-labeled Glu in these vesicles (72). [?H]- 
Glu was incorporated into CG4288-containing 
proteoliposomes in the presence of valino- 
mycin, a K* ionophore that generates an 
electrochemical gradient required for V@LUT 
activity. This incorporation was suppressed 
by the VGLUT blocker, EvansBlue, or by the 
addition of excess unlabeled Glu (fig. S2B), 
verifying that CG4288 encodes a previously 
unidentified DVGLUT. Thus, we designated 
the previously characterized DVGLUT en- 
coded by CG9887 as DVGLUT]I, and our newly 
characterized DVGLUT encoded by CG4288 as 
DVGLUT2. 
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To determine where DVGLUT?2 is located in 
the brain, we generated an antibody to DVGLUT2 
(fig. S3A) and performed immunofluorescence 
analyses. We did not find DVGLUT2 in the MB 
lobes (Fig. 1A). Instead, we found it in struc- 
tures reminiscent of ensheathing glia (EG), 
which surround neuropil structures including 
the MBs and the central complex. EG can be 
identified by using an EG GAL4 driver, NP6520 
(13, 14), and a UASga1-GFP reporter. GFP (green 
fluorescent protein) expression cleanly over- 
lapped with the DVGLUT2 signal (Fig. 1B and 
fig. S3B). GFP expression from NP6520 was 
not affected when we suppressed GAL4 ac- 
tivity in neurons using pan-neuronal Elav- 
GAL80, whereas expression was abolished 
when we suppressed GALA: activity in glia 
using pan-glial Repo-GAL80 (Fig. 1C). Expres- 
sion of DVGLUT2 transcripts significantly de- 
creased when two different DVGLUT2-RNA 
interference (RNAi) transgenes (330461 and 
104145) were expressed from a glial Repo- 
GAL4- driver but not from a Elav-GAL4 driver 
(fig. S3C), further supporting the idea that 
DVGLUT?2 is expressed specifically in glia. As 
previously reported (/4), EG extend processes 
into MB lobes, where they are seen to divide 
MB compartments (Fig. 1C, arrowheads). 

Because DVGLUT2 has the ability to trans- 
port Glu into vesicles, we next examined 
whether we could identify synaptic vesicle 
(SV)-like structures in EG using electron 
microscopy. EG can be distinguished from 
MB structures by their higher electron den- 
sities (75). Consistent with immunofluore- 
scence analyses, we found that EG processes 
invade MB lobes (Fig. 1D and fig. S3D). We 
further detected SV-like structures in both MB 
lobes, and in EG structures close to MB lobes. 
Using electron microscopy and immunogold 
labeling, we confirmed that DVGLUT2 is asso- 
ciated with SV-like structures in EG (Fig. 1E 
and fig. S3E). The diameter of SV-like struc- 
tures detected in EG with electron microscopy 
and DVGLUT2-positive vesicles labeled with 
immunogold were statistically indistinguish- 
able from the diameter of SVs observed in MB 
neurons (fig. S3F). 


Vesicular exocytosis from EG is required for 
olfactory aversive conditioning 


To analyze the importance of DVGLUT2 in ol- 
factory aversive conditioning, we used the 
TARGET (/6) and Gene-Switch (17) systems to 
suppress DVGLUT2 in EG. Aversive learning 
was disrupted when DVGLUT2 was suppressed 
in adult glia but not in adult neurons (Fig. 2A 
and fig. S4A). Glial cells in the adult brain 
include cortex glia, astrocyte-like glia, and EG 
(14, 18). Olfactory aversive conditioning was 
impaired when DVGLUT2 was suppressed 
in EG but not in cortex or astrocyte-like glia 
(Fig. 2B). To further determine whether vesicle 
exocytosis from EG is required for aversive 
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Fig. 1. DVGLUT2 is found in EG. A 
(A) Immunohistochemistry using anti- 
body to DVGLUT (aDVGLUT2). The 

B lobes were visualized by expressing 
mCherry from the MB-LexA driver. 

The o and a' lobes of the MBs are 
labeled. (B) DVGLUT2 localizes to EG. 
GFP was expressed from the NP6520 
driver. The MB lobes were labeled using 
anti-Fas Il antibodies. (©) NP6520 

does not drive expression in neurons. 
(Left) GFP expression from the NP6520 
driver was not affected by Elav-Gal80, 
whereas (right) it was abolished by 
Repo-Gal80. The MB lobes were labeled 
by means of dsRed or anti-PKA anti- 
bodies. GFP expressed from NP6520 
seems to delineate MB compartments 
(arrowheads). Scale bars, (A) to 


(C) 10 um. (D) Elec 
of EG bordering the 
magenta. Magnified 


ron micrograph 
MB. EG are colored 
images of the 


regions (a and b) encompassed by 

the white rectangles at left are shown 
at right. (Insets) Further magnified 
images of the regions encompassed by 
white rectangles at right. Vesicle-like 
structures in EG are indicated by 


arrowheads in (a), and SVs in neuronal 
synapses are indicated by arrowheads 
in (b). (E) Immunoelectron micrograph 
of EG. Immunogold-labeled aDVGLUT2 is 
found on the vesicle-like structures. 
Regions encompassed by white squares 
are magnified in the insets. 


learning, we expressed a temperature-sensitive 
dynamin mutant, shi‘s (79), in EG (EG>shi*). 
Dynamin is required for SV recycling, and ex- 
pression of shi'* is used to abolish synaptic 
output from neurons at restrictive temperatures 
(19, 20). Aversive olfactory learning scores were 
significantly impaired in these flies when they 
were trained at restrictive temperature (30°C) 
and tested at permissive temperature (18°C), but 
not when they were trained at the permissive 
temperature and tested at the restrictive tem- 
perature (Fig. 2C and fig. S4, B and C). These 
results indicate that vesicle recycling in EG is 
required during aversive learning when electric 
shocks are used as the US. EG>shi‘ at 30°C 
produced a larger learning impairment than 
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MB>mCherry 


C MB-dsRed; 
NP6520/Elav-GAL80>GFP 


NP6520>GFP 


MB-dsRed; 


NP6520/Repo-GAL80>GFP 


suppressing DVGLUT2 (EG>DVGLUT2-RI). 
This suggests that either glial suppression of 
DVGLUT?2 is less efficient than shi's-dependent 
inhibition of vesicle cycling, or that other trans- 
mitters in addition to Glu may be released from 
EG vesicles to affect learning. 

Denatonium is a bitter compound that can 
also be used as an aversive stimulus during 
aversive memory conditioning (27). Aversive 
memory formation by using denatonium as the 
US was also severely impaired in EG>shi* flies 
at restrictive temperature, whereas appetitive 
memory formation by using sucrose as the 
reward was unaffected (Fig. 2, D and E), in- 
dicating that EG transmit aversive, but not 
appetitive, information to the MBs. 
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NP6520/Repo-GAL80>GFP 


Aversive stimuli induce vesicle exocytosis and 
Glu release from EG 

To determine whether aversive stimuli induce 
vesicle exocytosis from EG onto MB lobes, we 
performed syb-GRASP (GFP reconstitution 
across synaptic partners) experiments (22) in 
which functional GFP is only reconstituted upon 
vesicular release from EG at the MBs. We gen- 
erated transgenic flies expressing synaptobrevin 
fused to GFP exons 1 to 10 (syb-spGFP1-10) in 
EG and its GRASP partner, mouse CD8, fused 
to GFP exon 11 (mCD8-spGFP11) in the MBs. 
In these flies, syb-spGFP1-10 is transported to 
the cell surface of EG upon vesicle fusion, and 
functional GFP is reconstructed when vesicle 
fusion occurs in the vicinity of mCD8-spGFP11 
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Fig. 2. Vesicular exocytosis from EG is required for aversive memory 
formation. (A) Reducing expression of DVGLUT2 in glia but not neurons disrupts 
olfactory aversive conditioning. During conditioning, flies are exposed to an 
odor (the CS+ or shock-paired odor) for 1 min while simultaneously being 
electrically shocked every 5 s. Flies are next exposed to a second odor (the CS-, 
or unpaired odor) for 1 min in the absence of shocks. Memory acquisition is 
measured 3 min after conditioning by having the flies choose between the two 
odors. A performance index (PI) is calculated by subtracting the percentage of 
flies that choose the shock-paired odor from the percentage that choose the 


expressed in the MBs. No GFP signals were 
observed in these flies in the naive state or 
after odor presentation alone. However, sig- 
nals were observed after shock alone or after 
simultaneous shock-odor presentation during 
aversive olfactory conditioning (Fig. 3A). Al- 
though EG surround all MB lobes, GFP signals 
were not observed in all lobes homogeneously 
but instead observed specifically in the pedun- 
cle, y1, y5, 2, B1, B’l, and B’2 compartments of 
the MB lobes (Fig. 3, B and C). We observed 
similar but not identical GFP signal patterns 
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when the transgenic flies were exposed to de- 
natonium or to noxious heat but not when 
they were fed sugar rewards (fig. S4D). 

To detect vesicle exocytosis from EG in real 
time, we generated transgenic flies that ex- 
press a DVGLUT2::pHluorin fusion protein in 
EG (Fig. 3D) and examined fluorescence 
changes in EG that contact the horizontal lobes 
of the MBs. Consistent with syb-GRASP re- 
sults, we observed vesicular exocytosis from 
EG in yl, y5, and 6’, B'2 compartments in re- 
sponse to shock presentation (Fig. 3E and fig. 
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unpaired odor. A higher PI is correlated with greater memory. DVGLUT2 was 
suppressed in adult flies at the restrictive temperature (+HS). (B) DVGLUT2 is 
required acutely in EG during olfactory aversive conditioning. DVGLUT2 was 
suppressed acutely in EG, cortex glia (CG), and astrocyte-like glia (ALG). 

(C) Blocking vesicular exocytosis from EG during conditioning impairs aversive 
memory formation. (D) Blocking vesicular exocytosis from EG during conditioning 
impairs aversive memory formation with denatonium. (E) Blocking vesicular 
exocytosis from EG during conditioning does not affect appetitive memory formation 
with sucrose. ***P < 0.001. 


S5A). Because of the orientation of the fly and 
the angle of the brain in our in vivo setup, we 
could not observe the peduncles, «2, and B1 com- 
partments, which also showed vesicular exo- 
cytosis in syb-GRASP experiments. 

Next, to determine whether vesicular re- 
lease is associated with increases in EG in- 
tracellular Ca”*, we used transgenic flies 
expressing syt-GCaMP, a fusion protein of 
synaptotagmin and GCaMP (23), in EG. Shock, 
but not odor, presentation induced Ca”* re- 
sponses in EG surrounding MB lobes in a 
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Fig. 3. Electrical shocks induce vesicle 
exocytosis from EG. (A) GRASP signals 
(reconstruction of GFP from component parts) 

in naive, odor-stimulated, shock-stimulated, and 
odor and shock-stimulated EG>syb-spGFP}.10; 
MB>mCD8-spGFP}, flies. (B) Magnified GRASP 
signals. Arrowheads indicate GFP signals at 
junctures between compartments. (C) Schematic 
diagram showing the location of shock-induced 
GRASP signals. (D) Schematic diagram of 

vesicle exocytosis monitored by means of 
DVGLUT2::pHluorin. (E) Vesicular exocytosis 

rom EG in the MB yl compartment in response 
to electric shocks. Triangles indicate electrical 
shocks in this and subsequent figures. All functional 
imaging was performed at 20 mM Mg** concen- 
trations in this study unless otherwise mentioned. 
(F) Ca** responses in EG bordering the yl compart- 
ment during odor and shock presentation. The 
ight blue band in Figs. 3 to 6 indicates odor 
(3-octanol) presentation. Peak responses were 
determined for the period indicated by the black 
bars. 3-Octanol was used as the odor in this 
study unless otherwise mentioned. (G) Glu release 
rom EG in the yl compartment during odor and 
shock presentation. Scale bars,e 10 wm. The time 
windows used to determine peak responses are 
shown as black bars. ***P < 0.001. 


pattern similar to what we observed for ve- 
sicular exocytosis (Fig. 3F and fig. S5B). Fur- 
thermore, denatonium treatment, but not 
sugar treatment, induced Ca”* responses in 
EG (fig. S4E). When we next measured Glu 
release from EG by expressing the Glu sensor 
iGlusnFR(A184V) (24) in EG, we again found 
a pattern of increase in Glu signals from EG 
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that paralleled Ca?* and vesicular exocytosis 
signals (Fig. 3G and fig. S5C). 


EG transmits aversive signals to the MBs 


To next examine MB Ca”* responses during 
electrical shock presentation, we expressed syt- 
GCaMP in the MBs. Although odor exposure 
evoked significant Ca?* responses in the MBs, 
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we did not observe Ca?* responses in any MB 
compartments upon shock presentation (Fig. 4A 
and fig. S6A). We verified that Glu was released 
onto MB lobes in response to electrical shocks 
and denatonium but not to odors and sugar 
(Fig. 4B and fig. S6B). Patterns of Glu release 
onto MB compartments paralleled patterns of 
Glu release from EG upon electrical shocks. 
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Why were Ca”* responses not observed in the 
MBs even though Glu was released onto the 
MBs upon electrical shock? Aversive somato- 
sensory information is conveyed by ascend- 
ing fiber of the ventral nerve cord (AFV), and 
previous ex vivo imaging studies suggest that 
AFV-evoked MB responses are mediated by 
NMDARs (4, 5). Because of Mg”* block, Glu bind- 
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ing on its own is unable to activate NMUDARs 
at resting membrane potentials. Instead, activ- 
ation requires both Glu binding as well as post- 
synaptic depolarization to release Mg”* ions 
that block the channel pore (25, 26). The con- 
centration of extracellular Mg”* in the fly brain 
is around 20 mM (27), and Mg** concentrations 
higher than 10 mM are sufficient to cause Me”* 
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Odor 


Odor Shock 


block (8). When we reduced the extracellular 
Mg”* concentration to 2 mM, we found significant 
Ca?* responses in the MBs upon electrical shocks, 
and these responses were suppressed by the 
NMDAR inhibitor MK801 (Fig. 4C and fig. 
S6C). To determine whether shock-evoked 
MB responses at low Mg** concentrations are 
caused by vesicular exocytosis from EG, we 
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Fig. 5. Odor inputs gate the transmis- A 
sion of shock information to the 

MBs. (A) Ca** responses in the y1 
compartment during odor, shock, 

and simultaneous odor and shock 
presentation. (B) The NMDAR blocker 

CPP abolishes shock-dependent Ca** 
responses in the MBs during simulta- 

neous odor and shock presentation. 

(C) Ca** responses in the MBs in 

which observation windows were 

sealed with transparent glue in the 

absence of added buffers. (D) Artificial B 
activation of EG can replace aversive 

stimuli during aversive conditioning. 

PI scores measure the flies’ preference 

for the CS— odor compared with the 

CS+ odor. ***P < 0.001. 


AF / F, (v1) 


examined MB Ca”* responses in transgenic flies 
expressing GCaMP in the MBs and shi‘ in EG 
at 2 mM Mg** conditions (Fig. 4D). Whereas 
both shock- and odor-evoked Ca?* responses 
were observed at permissive temperature (18°C), 
shock-evoked but not odor-evoked responses 
were abolished at restrictive temperature (32°C). 
Moreover, when DVGUT2 was suppressed in 
EG, again shock-evoked but not odor-evoked 
Ca?* responses were extinguished (Fig. 4). 
Although syb-GRASP data suggest that ve- 
sicular exocytosis from EG occurred predom- 
inantly at the outer border of y1, y4, y5, B’1, and 
6'2 compartments, Glu responses were ob- 
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served throughout each of these compartments 
(fig. S7). 


Integration of aversive information into the MBs 
is gated by coincident odor input 


We next examined whether depolarization 
of MB neurons by odor presentation is suf- 
ficient to remove Mg”* block and permit the 
transmission of shock information to the MBs. 
When flies were exposed to an odor in the 
absence of electric shocks, we observed Ca?* 
responses in the MBs. When flies were ex- 
posed to shocks in the absence of odors, we 
observed no MB Ca”* responses. However, 
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when we exposed flies to odor and shocks 
simultaneously, we observed a synergistic 
effect in which previously missing shock re- 
sponses were observed added to odor responses 
(Fig. 5A and fig. S8A). Shock-associated Ca?* 
influx during coincident odor shock inputs 
were suppressed by the NMDAR blocker CPP 
(carboxypiperazin-4-yl-propyl-1-phosphonic 
acid) (Fig. 5B). 

Although we found that shock inputs in the 
absence of odor inputs are prevented from being 
transmitted to the MBs at 20 mM Mg”* con- 
centrations, other physiological and functional 
imaging studies used lower Mg”* concentrations 
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and did not observe this effect. Thus, to deter- 
mine whether our observed gating mechanism 
occurs in the absence of externally added buf- 
fer, we performed experiments in which we 
opened the fly head capsule and filled it with 
transparent glue that was hardened by ultra- 
violet (UV) irradiation in the absence of added 
buffers (28, 29). Similar to our observations of 
when we filled head capsules with buffer con- 
taining 20 mM Mg”, we observed Ca”* re- 
sponses in the MB upon odor presentation, 
but not shock presentation alone. We further 
observed shock-associated Ca** responses dur- 
ing simultaneous odor shock presentation (Fig. 
5C and fig. S8B), again indicating that shock 
information is only transmitted to MB neurons 
that have been depolarized by odor inputs. 

Because EG convey aversive information to 
the MBs, we examined whether artificial activ- 
ation of EG can replace aversive stimuli during 
conditioning. We used transgenic EG>TRPA1 
flies expressing the temperature-sensitive cat- 
ion channel dTRPAI (30) in EG and observed 
increased Ca** responses when dTRPAI channels 
were activated by temperature shifts to 32°C 
(fig. SQA). When we performed syb-GRASP ex- 
periments in these flies, similar to denatonium, 
heat, and electrical shock presentation we ob- 
served reconstituted GFP signals in the peduncle 
and y1 of the MB lobes at 32°C, indicating that 
vesicular exocytosis from EG to MB neurons 
occurs upon activation of dTRPA1 in EG (fig. 
S9B). After exposing these flies to two odors— 
the first (CS+) at 32°C to activate dTRPAI, and 
the second (CS-) at 18°C, at which TRPA1 
channels are inactive—we observed that flies 
displayed a subsequent strong avoidance to the 
CS+ odor (Fig. 5D), demonstrating that arti- 
ficial activation of EG can bypass the require- 
ments for aversive stimulation during aversive 
memory formation. 


EG also transmit aversive signals to DANs 


Electrical shocks induce Ca”* influx in the 
axon terminals of DANs and dopamine (DA) 
release onto MBs (31-34). However, the neural 
mechanism that causes shock-associated DA 
release remains unclear. To examine whether 
vesicular exocytosis from EG also transmits 
aversive information to DANs to regulate DA 
release, we used transgenic flies expressing 
shi‘ in EG and a DA sensor in the MBs and 
measured DA release onto yl. At the permis- 
sive temperature, we observed DA release 
onto yl after both shock and odor presenta- 
tion. At the restrictive temperature, shock- 
associated DA release was abolished, whereas 
odor-associated DA release was unaffected, 
indicating that vesicular release from EG is 
required specifically for shock-dependent DA 
release onto yl (Fig. 6A). We also observed 
Ca?* responses in DAN axon terminals in the 
yl compartment in response to odor presen- 
tation and shock presentation. Again, shock- 
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associated responses were abolished at the 
restrictive temperature in EG>shi‘ flies, dem- 
onstrating that both Ca?* responses and DA 
release from DANs depend on EG signaling 
(Fig. 6B). Further, shock-associated but not odor- 
associated Ca?* responses were extinguished 
by CNQX (6-cyano-7-nitroquinoxaline-2,3-dione), 
an inhibitor of AMPA and kainate-type Glu 
receptors (Fig. 6C). Consistent with this, when 
we suppressed the expression of kainate re- 
ceptors in the DANs, Ca?* responses associated 
with shock presentation but not odor presenta- 
tion were significantly suppressed (Fig. 6D). To 
examine whether DA activity may also be re- 
quired for shock-associated EG responses, we 
expressed a shi'* transgene in DA neurons but 
did not observe any differences in shock- 
associated Ca?* responses in EG at restrictive 
or permissive temperatures (Fig. 6E). Thus, EG 
function upstream to both MBs and DANs 
during transmission of aversive information. 
Glu released from EG binds to and activates 
kainate receptors on DANs, whereas an inde- 
pendent mechanism activates DANs during 
odor exposure. 

We next examined whether EG-mediated 
DA release is involved in associative learning. 
Unexpectedly, when we suppressed kainate 
receptors in DANs, we found that aversive 
learning (3-min memory) and long-lasting 
7-hour memory were normal (Fig. 6F). In con- 
trast to conventional forward conditioning, 
backward conditioning—in which the onset 
of the US occurs before CS exposure—does 
not normally form associative memory. We 
found that suppression of kainate receptors in 
DANs caused flies to form significant 3-min 
memory after backward conditioning (Fig. 
6G). These results suggest that direct trans- 
mission of aversive information to DANs 
through kainate receptors is not required for 
normal memory but instead functions to pre- 
vent formation of inappropriate associations 
in situations in which the CS does not serve 
as a predictor of the US. 


Discussion 


Glia are known to regulate neural and synap- 
tic plasticity and play a critical role in reg- 
ulating behavioral changes (35-39). Our data 
indicate that glia can further function to di- 
rectly transmit aversive sensory information 
to higher-order brain areas and separate aver- 
sive sensory information from reinforcement 
signals during the formation of associative 
memories. Previously, DA had been proposed 
to both transmit aversive information to the MBs 
and reinforce associations (33, 40-43). However, 
our data indicate that EG Glu pathways that 
mediate transmission of aversive information 
are distinct and function upstream of DA path- 
ways that reinforce associations (32, 44, 45). 
Aversive stimuli induce increases in Ca?* con- 
centrations, vesicular exocytosis, and release of 
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glutamate from EG. Blocking vesicular cycling 
or suppressing glutamate transporter activity 
in EG disrupts memory, whereas artificial activ- 
ation of EG can replace aversive stimulation to 
induce associative memory formation. 

Because associations can be formed with 
many different odors, aversive Glu signals are 
likely to be sent to many MB axons. However, 
plasticity should only occur at select locations 
where coincident odor and aversive informa- 
tion converge (46). NMDARs have been pro- 
posed to function as coincidence detectors that 
regulate Hebbian plasticity because they re- 
quire both depolarization to remove Mg”* block 
and Glu binding to be activated (47, 48). Our 
results are consistent with this idea. Gluta- 
mate released from EG on its own is unable to 
induce Ca”* influx into mushroom body neurons 
at physiological Me”* concentrations but is able 
to do so in low-Mg”* conditions or when mush- 
room body neurons are coincidently activated 
by odors. Furthermore, suppressing NMDARs 
in the mushroom bodies impairs aversive learn- 
ing (5). Altogether, our results indicate that odor- 
dependent depolarization removes Mg** block 
of NMDARs, which allows glutamate released 
by EG to activate NMDARs in appropriate neu- 
rons for formation of odor-specific memories. 

Glutamate released by EG also induces Ca?* 
activity and DA release through activation of 
kainate-type glutamate receptors on DANs. 
Unexpectedly, however, direct transmission 
of aversive information from EG to kainate 
receptors on DANSs is not necessary for form- 
ing associations. Instead, our data suggest that 
transmission of aversive information to DANs 
prevents associations from forming in situa- 
tions in which the CS cannot be used to pre- 
dict the US. Thus, kainate receptor-dependent 
DA release in the absence of odors prevents 
the formation of inappropriate associations, 
whereas DA released upon coincident odor or 
shock stimulation of postsynaptic MB neurons 
through an independent mechanism reinforces 
associations. Previous studies have proposed 
mechanisms that allow postsynaptic mush- 
room body neurons to induce local DA release. 
Cervantes-Sandoval et al. (49) describe recip- 
rocal synapses formed between mushroom 
body neurons and DA neurons where DANs 
receive cholinergic input from MB neurons, 
and ex vivo studies suggest that coincidentally 
activated MB neurons generate carbon mon- 
oxide that functions as a retrograde messenger 
to induce local DA release from presynaptic 
terminals (45, 50). We hypothesize that both 
or either of these mechanisms may reinforce 
association in coincidentally activated MB neu- 
rons (fig. S10). 

Although EG cover the entire MB neuropil, 
aversive stimuli do not cause EG to release 
glutamate to all MB compartments equally. 
Instead, vesicular exocytosis and glutamate 
release occur prominently in the peduncle, 
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Fig. 6. EG regulates DA 
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yl, and y5 compartments with little to no re- 
lease in compartments such as y3. These data 
are consistent with models assigning different 
valences to different MB compartments (23, 57). 
In mammalian astrocytes, different activation 
pathways are known to produce distinct micro- 
domain activity (52). Our current data support 
the idea that physiological and functionally 
relevant information is precisely arranged and 
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processed in spatially different subclasses and 
subdomains of astrocytes (53-55). 


Methods summary 
Cloning DVGLUT2 cDNA 


DVGLUT2 (CG4288) was amplified by means 
of polymerase chain reaction (PCR) by using the 
following primers: 5'-GGGGACAAGTTGTGTCA- 
AAAGCAGCTTCCAAGCAGTTCGGATTCCAGT-3’, 
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and 5'-GGGGACCACTTTGTACAAGAAAGCTG- 
GGTCGACACAATGGTGCGCATATC-3’, and cloned 
into pDONR221 (Thermo Fisher Scientific). 


Fly strains and generation of 
UAS-DVGLUT2::pHluorin flies 


A w Canton-S line, w(CS10), was used as our 
wild-type line, and all transgenic lines used in 
our experiments were outcrossed to w(CS10) 
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at least six times (56). DVGLUT2::pHluorin 
was generated by inserting pHluorin into the 
first SV loop of DVGLUT2 as previously de- 
scribed for mammalian VGLUT2::pHluorin (57). 


Immunoblotting and immunohistochemistry 


Antibodies to DVGLUT2 peptide were gener- 
ated in rabbits against the C-terminal sequence 
of DVGLUT2, CQLTQTAGFVNSGAELKD, and 
affinity purified. Antibodies were used for im- 
munoblotting at 2.8 ug/ml and immunohisto- 
chemistry at 6.9 ug/ml. For synaptic-GRASP 
experiments, reconstituted GFP was detected 
by using antibody to GRASP (Sigma-Aldrich). 
For electrical shock presentation, flies were 
placed in the electrical grid chamber and re- 
ceived 1.5-s pulses of 60-V electrical shocks for 
1 min at 0.2 Hz. For aversive bitter taste pres- 
entation, flies were starved for 16 hours in vials 
containing Whatman paper saturated with 
water. Starved flies were then placed in vials 
containing Whatman paper saturated with 
10 mM denatonium for 1 min. For reward su- 
crose presentation, starved flies were placed in 
vials containing Whatman paper with 2 M 
sucrose for 2 min. For noxious heat presenta- 
tion, flies were placed in glass vials and im- 
mersed in a 42°C water bath for 1 min. 


Glu uptake assays 


The expression vector DVGLUT2/pET-28a(+) 
was constructed and transformed into Escherichia 
coli C43 (DE3) cells according to previously de- 
scribed protocols (12). Purified DVGLUT2 was 
reconstituted into proteoliposomes by using 
the freeze-thaw method, and 1-Glu uptake 
was assayed using the gel-permeation pro- 
cedure (58). 


Electron microscopy 


Fly brains were fixed with 2% paraformalde- 
hyde (PFA) 0.1 M cacodylate buffer, pH 7.3 (CB) 
containing 2.5% glutaraldehyde and 15% sat- 
urated picric acid, postfixed with 2% osmium 
tetroxide in 0.1 M CB, and stained with 50% 
saturated uranyl acetate in 50% ethanol. Sam- 
ples were dehydrated, polymerized in epoxy 
resin (EPON 812, TAAB), sectioned into coro- 
nal serial semithin sections (4 «1m thickness), 
and stained with toluidine blue. Sections were 
embedded on glass microscope slides and 
polymerized in epoxy resin. A total of 40 to 
60 ultrathin serial sections (50 to 80 nm thick- 
ness) were cut, stained with uranyl acetate and 
lead citrate, and observed under a transmis- 
sion electron microscope (JEM-1400, JEOL, 
and H-7650, HITACHI). 


Immunoelectron microscopy 


Brains were fixed with 4% PFA in 0.1 M phos- 
phate buffer containing 0.1% glutaraldehyde 
and 15% saturated picric acid, blocked with 
10% Normal goat serum/0.05% NaNs;/PBST 
(phosphate buffered saline with Tween20), and 
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incubated with rabbit antibody to DVGLUT2 
peptide at 2.8 ug/ml. Samples were then in- 
cubated with colloidal gold [1:100, EM goat 
anti-rabbit immunoglobulin G (IgG), 20 nm, 
BB International) and enhanced by using a 
silver enhance kit (AURION R-Gent SE-EM, 
Aurion). Post-fixations were conducted with 
1% osmium tetroxide in 0.1 M phosphate buffer, 
followed by dehydration in a graded series of 
ethanol and propylene oxide. Samples were 
embedded in epoxy resin (EPON 812, TAAB). 
Ultrathin sections were observed under a trans- 
mission electron microscope (JEM-1400, JEOL). 


Quantification of Transcripts 


Transcripts were quantified by means of real- 
time PCR (CFX96 C1000, Bio-Rad) as described 
previously (59). 


Behavioral assays 


Standard single-cycle aversive olfactory condi- 
tioning was performed as previously described 
(56, 60). Two odors [3-octanol (OCT) and 
4-methylcyclohexanol (MCH)] were used as 
CS. For conditioning with electrical shocks, 
the US of electrical shocks was paired with 
one of the odors and consisted of 1.5-s pulses 
of 60-V dc electric shocks. For conditioning 
with denatonium, flies were exposed to the 
CS+ odor in a training vial containing 10 mM 
denatonium for 1 min and then transferred to 
a training vial containing water, where they 
were exposed to the CS- odor for 1 min. To test 
for memory retention, flies were placed at the 
choice point of a T-maze in which they were 
exposed simultaneously to the CS+ (previously 
paired with the US) and CS- (unpaired with the 
US). A performance index (PI) was calculated by 
subtracting the percentage of flies that chose the 
CS+ from the percentage that chose the CS-. 
Peripheral control experiments, including odor 
acuity and shock reactivity assays, were per- 
formed as described previously (56, 60). 

For appetitive conditioning with sucrose, 
flies were first exposed to the CS- odor for 2 min 
and then transferred to a training vial con- 
taining 2 mM sucrose, where they were ex- 
posed to the CS+ odor for 2 min. Flies were 
tested as described above, and PIs were cal- 
culated by subtracting the percentage of flies 
that chose the CS- odor from the percentage 
that chose the CS+ odor. 

For temperature shift experiments, flies were 
transferred to a room at 30°C 5 min before 
providing sensory stimuli, loaded into a pre- 
heated training chamber, and exposed to the 
CS+ odor for 1 min. Flies were then imme- 
diately transferred to a 20°C room, loaded into 
a 20°C training chamber, and exposed to the 
CS- odor for 1 min. 


Functional imaging 


Flies were immobilized in an imaging cham- 
ber, and the parietal cuticle was removed with a 
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syringe needle to expose the brain. A record- 
ing solution [70 mM NaCl, 5 mM KCl, 10 mM 
NaHCoOs, 5 mM Hepes (pH 7.1), 5 mM Treharose, 
115 mM Sucrose, 1.5 mM CaCl, and 20 mM 
MgCl.] was used to cover the brain. For low- 
Mg”* recording solution, 2 mM MgCl, was used 
instead of 20 mM. For functional imaging in 
the absence of added buffer, exposed brains 
were immediately covered with UV Curing Op- 
tical Adhesives (NOA68, Thorlabs) as described 
previously (28, 29). For olfactory aversive con- 
ditioning, 60-V, 100-ms pulses of electric shock 
were applied to flies through a 50-kQ resistor 
every 3 s while CS+ odor was pumped through 
the imaging chamber at 200 ml/min for 30 s. 

Fluorescent images were captured at 4 Hz 
by use of a confocal microscope system. Fo was 
calculated as the mean fluorescence intensity 
during the 10 s immediately before stimula- 
tion, and AF was defined as the difference 
between the F' values at each time point. AF/ 
Fo for DVGLUT2::pHluorin (vesicular exocy- 
tosis) and DAim (DA responses) were mea- 
sured similarly to measurements of AF/F 
for GCaMP (Ca?* responses). To quantify 
stimulation-induced fluorescent changes, AF/Fo 
in the absence of stimulation was subtracted 
from AF/Fo at indicated time points during 
and after stimulation. MK-801 (Tocris), CPP 
(Tocris), and CNQX (Tocris) were used at a 
concentration of 100 uM, 1 mM, and 200 uM, 
respectively. 


Statistics 


All data in bar and line graphs are expressed 
as means + SEMs; 7 = 6 to 12 independent 
assays for all data. Student’s ¢ test was used to 
evaluate the statistical significance between 
two data sets. For multiple comparisons, one- 
way or two-way analysis of variance was used, 
followed by post hoc analyses by use of Prism 
software (GraphPad Software) or XLtoolbox 
(Free Software Foundation). Statistical sig- 
nificance was indicated as *P < 0.05, **P < 
0.01, and ***P < 0.001 and not significant for 
P > 0.05. 
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Structural visualization of transcription 


initiation in action 


Xizi Chen}, Weida Liut, Qianmin Wang, Xinxin Wang, Yulei Ren, Xuechun Qu, Wanjun Li, Yanhui Xu* 


INTRODUCTION: To achieve sophisticated tran- 
scription regulation, eukaryotic RNA polymer- 
ase II (Pol IT) does not open the core promoter 
by itself and requires general transcription 
factors (GTFs) (~38 polypeptides in total) to 


assemble a preinitiation complex (PIC). The 
PIC then sequentially transforms into an open- 
promoter PIC complex (OC), an initially tran- 
scribing complex (ITC), and an early elongation 
complex (EEC). Previous studies reported that 
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notable functional and compositional changes 
occurred during these processes. However, the 
underlying molecular mechanisms remain large- 
ly elusive. 


RATIONALE: We reconstituted the de novo tran- 
scribing complexes (TC2 to TC17) with Pol II 
halted on G-less promoters when nascent RNAs 
reach 2 to 17 nucleotides (nt) in length, re- 
spectively. We determined the cryo-electron 
microscopy (cryo-EM) structures and recapitu- 
lated the dynamic processes of transcription 
initiation by connecting structures of the PIC 
(Protein Data Bank ID 7EGB) and 16 TC 
complexes. 


RESULTS: Structural analyses reveal marked 
complex reorganizations. Starting from the PIC, 
the GTF TFIIH couples the energy of adenosine 
triphosphate (ATP) hydrolysis to translocate the 
downstream promoter to Pol II, resulting in an 
OC that contains a transcription bubble of 
~13 nt with the transcription start site (TSS) 
positioned at the active site. The OC then 
transforms into the ITC (TC2 to TC9), during 
which the upstream promoter and GTFs remain 
bound to Pol IJ, and thus the transcription 
bubble expands from 14 to 21 nt within the 
active center. The melted template strand guides 
the synthesis of nascent RNA, which increases 
its association with Pol II as it grows from 2 to 
9 nt, which is correlated with a transition from 
abortive transcription to escape commitment. 
The differences between TC9 and TC10 reveal 
a sharp ITC-EEC transition and suggest that 
nucleoside triphosphate (NTP)-driven RNA- 
DNA translocation and template-strand accu- 
mulation in a nearly sealed channel may balloon 
out the channel, resulting in GTF dissociation, 
bubble collapse, and promoter escape. The 
structures of TC10 to TC17 reveal EEC confor- 
mation, indicating that Pol II escapes the ini- 
tiation region and proceeds on the promoter, 
followed by pausing, RNA capping, pause- 
release, and productive elongation. 


CONCLUSION: GTFs help Pol II open the core 
promoter through multiple interactions, 
which also create an obstacle to the escape 
of Pol II from the promoter and therefore 
must eventually be disrupted. Our study pro- 
vides structural visualization of transcrip- 
tion initiation in action for understanding 
why and how transcription machinery un- 
dergoes such notable changes. 
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Structural visualization of transcription 


initiation in action 


Xizi Chen*?+, Weida Liu’t, Qianmin Wang"t, Xinxin Wang"t, Yulei Ren’, Xuechun Qu’, 


Wanjun Li, Yanhui Xu>?* 


Transcription initiation is a complex process, and its mechanism is incompletely understood. We 
determined the structures of de novo transcribing complexes TC2 to TC17 with RNA polymerase II halted 
on G-less promoters when nascent RNAs reach 2 to 17 nucleotides in length, respectively. Connecting 
these structures generated a movie and a working model. As initially synthesized RNA grows, general 
transcription factors (GTFs) remain bound to the promoter and the transcription bubble expands. 
Nucleoside triphosphate (NTP)-driven RNA-DNA translocation and template-strand accumulation in a 
nearly sealed channel may promote the transition from initially transcribing complexes (ITCs) (TC2 to 
TC9) to early elongation complexes (EECs) (TC10 to TC17). Our study shows dynamic processes of 
transcription initiation and reveals why ITCs require GTFs and bubble expansion for initial RNA synthesis, 
whereas EECs need GTF dissociation from the promoter and bubble collapse for promoter escape. 


n eukaryotic cells, core promoters are made 
up of ~100 base pairs (bp), including elem- 
ents that direct preinitiation complex (PIC) 
assembly and span the transcription start 
sites (TSSs) (/-3). Transcription initiation 
starts with the assembly of a PIC that consists 
of RNA polymerase II (Pol II) and general tran- 
scription factors (GTFs), including TFIIA, TFIIB, 
TFIID, TFUE, TFIIF, and TFIH (4-6). TFHD 
recognizes the core promoter through TATA box 
binding protein (TBP) and TBP-associated fac- 
tors (TAFs) and recruits Pol II and other GTFs 
to assemble a closed-promoter PIC complex (CC). 
Adenosine triphosphate (ATP) hydrolysis of 
TFIIH results in the translocation of the down- 
stream promoter to the Pol II active center. The 
PIC transforms into an open-promoter PIC com- 
plex (OC), which contains a transcription bubble 
with the template strand inserted into the Pol II 
active site to guide the synthesis of nascent RNA. 
Earlier structural studies revealed mechanisms 
of nucleoside triphosphate (NTP)-driven RNA 
growth and RNA-DNA translocation (7-13) 
as well as PIC assembly and promoter open- 
ing (4-24). 
Extensive biochemical studies have shown 
marked functional and compositional changes 
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at the early stage of transcription initiation. 
Briefly, an initially transcribing complex (ITC) 
is formed when RNA synthesis starts, and abor- 
tive initiation can be observed when RNAs are 
2 or 3 nucleotides (nt) in length (25-28). Escape 
commitment has been proposed to occur when 
RNA reaches 4 nt long (29-33). When RNA is 
7 to 10 nt long, the transcription bubble ex- 
pands up to 17 or 18 nt, followed by an abrupt 
bubble collapse (33-37). Pol II will eventually 
break its contact with GTFs, escape from the 
promoter, and form an early elongation com- 
plex (EEC) (38, 39). Although structures of re- 
constituted ITCs have been reported (40-46), 
these complexes were not started from a PIC 
but instead were assembled on artificial RNA- 
DNA hybrids that were formed by a mismatched 
transcription bubble in which the template 
strand paired with a synthetic complementary 
RNA. A recent study captured a structure of a 
yeast (Saccharomyces cerevisiae) backtracked 
ITC complex that started from a PIC (47). How- 
ever, transcription initiation in yeast and mam- 
mals differs in various aspects. For example, 
yeast polymerase undergoes TSS scanning on 
the promoter for up to a hundred nucleotides 
(48, 49), which does not occur in mammals. 

Reported structures to date provide fragmented 
glimpses of complexes that have varying compo- 
sitions and different promoters. The present 
understanding of de novo transcription initia- 
tion in mammals, including GTF dissociation, 
bubble expansion and abrupt collapse, and the 
ITC-EEC transition, is limited by inconsistencies 
across various experimental contexts and the 
lack of continuously captured structures of tran- 
scribing complexes in action. Despite decades 
of biochemical and structural studies, the under- 
lying molecular mechanisms of these processes 
remain largely elusive. 
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Results 

In vitro reconstitution of de novo 

transcription initiation 

To investigate de novo transcription initiation, 
we synthesized a series of modified super core 
promoters (SCPs) (Fig. 1A and data S1). Each 
promoter consists of an upstream TATA box, 
downstream TFIID-binding elements, and a 
G-less cassette between the putative TSS and a 
GG dinucleotide at varying positions on the 
nontemplate strand. The promoters are termed 
scp", with “n” denoting the G-stop position 
relative to the TSS, at which Pol II should stall if 
guanosine triphosphate (GTP) is not provided 
in the reaction. Mammalian Pol IT and human 
GTFs were purified as previously described (15, 16) 
(fig. SIA). The reactions of de novo transcription 
initiation were respectively performed on SCP** 
to SCP’? promoters in the presence of ATP, 
uridine triphosphate (UTP), [o-®’P]-UTP, cyti- 
dine triphosphate (CTP), and chain-terminating 
3'-O-methylguanosine-5'-triphosphate (? OMGTP). 
Note that reaction on SCP? was not performed 
because the generated AG dinucleotide would 
lack a radioactive U nucleotide and would not 
be able to be detected. 

The reaction on each SCP"® promoter resulted 
in a predominant G-stop RNA of the expected 
length (Fig. 1B; fig. S1, B to E; and supplementary 
text). Reaction on SCP*@ generated the highest 
level of G-stop RNAs among all the reactions, 
and reactions on SCP*® to SCP®® generated 
much more abundant G-stop RNAs compared 
with the reactions on SCP’* to SCP’. This 
suggests that short (3 to 6 nt) “dead-end” 
G-stop products are unstable and disso- 
ciate more frequently from the halted Pol II, 
which allows for multiple cycles of tran- 
scription initiation. RNA products longer than 
6 nt remain stably bound to Pol II for up to 
one round of initiation. 

We observed two short RNA products of 
~2 to 4 nt in length (Fig. 1B; fig. S1, B to E; 
and supplementary text). The RNA products 
at the lower position largely accumulated in the 
presence of TFTIS (fig. SID), which suggests that 
these short RNAs were derived from backtracked 
complexes, in which nascent RNAs backtracked 
to the active site by several nucleotides followed 
by RNA cleavage. A time-course reaction on 
scP'’S showed an accumulation of RNAs at 
a position above the cleaved RNAs (fig. S1D), 
which suggests that these RNAs are abortive 
products that have dissociated from the tran- 
scribing complexes. Reactions on SCP"“ and G-less- 
containing adenovirus major late (AdML)°O°¢ 
promoters under similar conditions generated 
similar heterogeneous RNA products that re- 
sulted from abortive transcription, cleaved RNAs 
from backtracked complexes, alternative TSS 
selection, and readthrough transcription (Fig. 
1B and fig. S1, B to E). This result generally 
agrees with earlier in vitro transcription reac- 
tions on the AdML promoter (26-28, 33) and 
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Fig. 1. In vitro reconstitution of de novo transcribing complexes. (A) Schematic diagram of SCP"> 
promoters. The numbers above the promoter represent the positions relative to the TSS (+1). The GG 
dinucleotide at position “n” and the G nucleotide at +19 provide the first and second (if readthrough occurs) 
G-stop sites for Pol Il. TCn represents the assembled transcribing complex on the indicated SCP"® promoter. 
The asterisk indicates that TC4 on SCP*° was used for the transcription assay and TC4 on SCP*¢ mutant 
promoter was used for the structural study (fig. Sl and supplementary text). (B) In vitro transcription 
initiation assay. RNA products from one of the two independent experiments were visualized by 
autoradiography. The positions of abortive RNA and cleaved RNA from backtracked complexes are indicated. 
(C) Schematic diagram of the assembly of transcribing complexes. Transparent cryo-EM maps of the PIC and 
a representative TC show the changes in the promoter that is embedded within the complexes. 


reported RNA heterogeneity in transcription 
initiation (50-52). 


Structure determination 


To visualize transcription initiation in action, 
we used SCP”“ to SCP"" promoters to assemble 
de novo transcribing complexes termed TC2 to 
TC17, respectively (Fig. 1C). TC2 represents an 
ITC that forms the first RNA phosphodiester 
bond. TC17 was chosen as the last complex 
because a nascent RNA longer than 17 nt 
emerges from Pol II (53). TC4 was assembled 
on a SCP*% mutant promoter that generated 
a better-resolved structure (see supplementary 
text). For each complex assembly, we first as- 
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sembled a PIC on an SCP"@ promoter as pre- 
viously described (15, 16). We then added ATP, 
CTP, UTP, and ® ©MGTP to start transcription 
initiation, which was followed by the addition 
of the mushroom toxin o-amanitin to inhibit 
the leaky activity of chain elongation. The reac- 
tion products were subjected to gradient fixation 
and cryo-electron microscopy (cryo-EM) struc- 
ture determination (fig. S2 and data S2 to S4). 
These transcribing complexes showed marked 
structural heterogeneity (fig. S2, data S2 to S4, 
and supplementary text), consistent with results 
from our in vitro assay (Fig. 1B and fig. S1, B to E) 
and earlier observations (30, 54-56). However, 
RNA heterogeneity in the reactions does not 
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necessarily correlate precisely with complex 
heterogeneity for structural studies because the 
latter represents a snapshot of the reaction at 
a specific point in time. For example, we found 
neither abortive complexes nor backtracked 
complexes in our cryo-EM data processing, 
likely owing to the following reason. Cryo-EM 
samples were prepared by freezing reaction 
products 50 min after the addition of NTP. 
DNA translocation and RNA synthesis were 
further inhibited by the addition of ? ©MGTP 
and o-amanitin. By contrast, abortive and back- 
tracked complexes were transient and unlikely 
to accumulate in the reactions. Even though 
small quantities of these complexes might exist 
in the samples, minor differences in nucleic acids 
were not sufficient to distinguish them from 
TC complexes in cryo-EM three-dimensional 
(3D) classification. 

To obtain structures at high resolution, we 
collected a large amount of data (580,409 images 
in total) with a 300-kV Titan Krios G3 micro- 
scope over ~110 days. Cryo-EM 3D classification 
and comparison defined particles that were 
used for the reconstruction of the TC2 to TC17 
complexes (summarized in data S4). Cryo-EM 
maps of most of the complexes revealed Pol II 
at a resolution of 2.7 to 3.3 A and well-ordered 
RNA-DNA hybrids in the active center (fig. S3 
and data S2). Nucleic acids, Pol II, and TFUF 
were modeled in all the complexes. Other 
GTFs were modeled by docking the structure 
templates into the cryo-EM maps of TC2 to 
TC9 but were not modeled in TC10 to TC17 
because of high flexibility. 


Overall structures of ITCs and EECs 


Dynamic processes of transcription initiation 
can be recapitulated by connecting structures 
of the PIC (16) and the transcribing complexes 
from TC2 to TCI7 (Fig. 2A, Movie 1, and movie 
S1). As detailed later in the text and in Figs. 3 to 6, 
TC2 to TC9 represent ITCs correlated with abort- 
ive transcription and escape commitment. TC10 
to TC17 represent EECs correlated with pro- 
moter escape. The differences between TC9 
and TC10 suggest a sharp transition from the 
ITC to the EEC that is correlated with bubble 
collapse and GTF dissociation. 

Structures of TC2 to TC9 reveal a similar 
overall architecture of the ITC (Fig. 2, A to C; 
fig. S4; movies S2 to S4; and supplementary 
text). Similar to what occurs in the PIC (76), 
GTFs remain bound to Pol II and the TATA 
box is bent by TBP. The following promoter 
region at —23 to —12 (—23/-12) is stabilized by 
the TFIIB N-terminal core domain (B-core-N), 
the TFIIB linker (B-linker), the TFIIF winged- 
helix domain (WH"™"™""), and the Pol II protru- 
sion. The upstream edge of the transcription 
bubble remains fixed at position —12 through- 
out TC2 to TC9. By contrast, the downstream 
edge of the transcription bubble is located at 
position n+2 in the TCn complex, which is 
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Fig. 2. Structures of the ITC and EEC. (A) Overall cryo-EM maps of TCn complexes and close-up views of nucleic acids. The color scheme is indicated at the 
bottom of the panel. The directions of strand translocation are indicated with arrows. The middle arrows indicate the functional states of Pol Il and the transcription 
bubble. (B) Locally refined cryo-EM maps of Pol Il, including Pol II, nucleic acids, and closely bound GTFs, in representative ITC and EEC complexes. (C and D) Close- 
up views of ITC (C) and EEC (D) structural models show protein-promoter interactions (top) and promoter conformations (bottom). 


indicative of the continuous insertion of the 
downstream promoter into the active center. 
The promoter downstream of n+2 remains a 
straight double helix and is tilted by ~50° 
relative to the upstream duplex. The down- 


stream promoter at n+2/n+11 inserts into the 
open DNA entry channel (channel"). TFIH binds 
the downstream promoter at n+22/n+31, con- 
sistent with its role early in transcription ini- 
tiation (25, 57-61), which is dependent on the 


Chen et al., Science 382, eadi5120 (2023) 22 December 2023 


downstream promoter regions (62). Nascent 
RNA pairs with the template strand to form 
an RNA-DNA hybrid in the active center. 
Cryo-EM maps of TC10 to TC17 show similar 
organization of the EEC (Fig. 2, A, B, and D; 
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Movie 1. A movie of transcription initiation. The movie was generated by connecting structures of the 
PIC (16) and TC2 to TC17 and shows the movements of Pol Il relative to the upstream promoter. The image 
on the left shows the full complex. The images at the upper right show an omitted structure and a cross- 
sectional view of the complex. The image at the bottom right shows nucleic acids. The color scheme is also 
shown at the bottom right. Brief descriptions of each state are included at the bottom of the screen. 


fig. S5; and movies S2 to S4). The RNA-DNA 
hybrid was well stabilized within the Pol II 
activity center, and a weak density of flanking 
RNA was observed in an open RNA exit chan- 
nel (channel®), The flanking upstream and down- 
stream double-stranded promoter regions are 
flexible owing to the lack of stabilization by 
GTFs, which nearly dissociate from the pro- 
moter (Fig. 2B and fig. S5A). The TFIIF dimeri- 
zation module and TFIE remain bound to Pol 
II. The remaining GTFs, including TFIID (with 
TBP), TFIIA, TFIIB, and TFIIH, are either com- 
pletely dissociated from or flexibly tethered to 
Pol II through the associations of TFIIF, TFIIE, 
and/or the exposed promoter elements. The 
flexibility of GTFs in the EEC may also explain 
seemingly inconsistent observations of GTFs 
on transcribing complexes, which vary under 
different experimental conditions (38, 39, 63-65). 
Thus, Pol II is no longer restrained by the 
upstream promoter-bound GTFs and is free to 
move forward and escape the promoter. Here, 
GTF dissociation refers to a dissociation from 
the promoter but not from Pol II. 


The transition from the PIC to the early ITC 


Compared to the PIC (/6), the ITC shows the 
following structural differences (Fig. 3, Movie 1, 
fig. S6, and movie S1): The downstream pro- 
moter is tilted by ~30° relative to that of the 
PIC, leading to slight displacement and rotation 
of the TFIITH core module and the TFIIH- 
associated TFIID away from Pol II. The cyclin- 
dependent kinase (CDK) activating kinase 
(CAK) module of TFITH was not observed, in 
agreement with the requirement that the 
TFIIH core be in a proper position for loading 
CAK to Pol II (/6). The Pol II active center is 
occupied by the melted promoter, and the ITC 
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exhibits a slight expansion in the direction of 
the DNA axis while becoming more compact 
in the vertical direction (as in Fig. 3B). TBP, 
the B-core-N, and the Pol II protrusion are 
slightly displaced toward the upstream direc- 
tion, whereas the Pol II lobe moves toward the 
downstream direction. The clamp head and 
clamp helices move toward the lobe by up to 
11 A. The clamp-associated rudder and B-linker 
are more ordered and may contribute to stabi- 
lizing the melted promoter. These domains and 
motifs would generate steric clashes with a closed 
promoter, which suggests their roles in maintain- 
ing the promoter in an open conformation. 
Early-stage transcription initiation is reca- 
pitulated by connecting structures from the 
PIC to TC3 (Figs. 3C and 6, Movie 1, and movie 
S1). Because Pol II by itself does not appear to 
efficiently open linear promoter DNA, the DNA 
translocase XPB of TFITH couples the energy of 
ATP hydrolysis to rotation and translocation 
of the downstream promoter to Pol II, resulting 
in promoter melting and displacement of the 
TSS by ~45 A. The downstream promoter in- 
serts into Pol II by translocation of ~16 bp, as 
measured by the displacement of the Pol II 
jaw-contacting point of the promoter, which 
is +5 in the PIC and +20 in TC3. This suggests 
an estimated insertion of 13 or 14 bp during 
the transition from the PIC to the OC. TFIIH- 
generated torque for promoter melting would 
be counteracted by rotation and translocation 
of the upstream promoter if it were not re- 
strained otherwise by Pol II-bound B-core-N, 
TBP, and WH". Thus, extensive interactions 
among promoter, Pol II, and GTFs provide an 
integrated framework to open the promoter 
and maintain the open conformation. After OC 
formation, the melted template strand guides 
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Pol II to synthesize the first phosphodiester 
bond of nascent RNA, followed by cycles of 
nucleotide addition, RNA chain extension, and 
RNA-DNA hybrid translocation (66, 67). 


TFIIH is required for efficient 
transcription initiation 


TFHUH is required for transcription initiation 
(58, 61) but may be dispensable for promoter 
opening or transcription initiation under cer- 
tain experimental contexts (24, 68-70). To test 
the effect of TFIIH in transcription initiation, 
we performed an in vitro transcription assay 
on representative templates (SCP*¢, SCP°S, 
scp?°?, and SCP!”°) in the presence and 
absence of TFIIH individually (fig. S1E). 
TFIIH supported the generation of G-stop RNAs, 
whereas the reactions in the absence of TFITH 
showed largely decreased amounts of G-stop 
RNAs. Thus, TFIIH is essential for efficient 
in vitro transcription initiation. Transcription 
initiation did occur in the absence of TFIIH, 
albeit much less efficiently. The results may 
reconcile earlier studies that appeared to be 
controversial and suggest a context-dependent 
effect of TFITH on transcription. 

The absence of TFIH led to only a slight 
decrease in the level of abortive RNAs, in 
sharp contrast to the largely decreased level 
of G-stop RNAs, which is indicative of an in- 
crease in the abortive/productive ratio (fig. SIE). 
Such an incremental abortive/productive ratio 
suggests that TFIIH may not only facilitate 
promoter opening but also prevent bubble 
reannealing (leading to RNA dissociation) in 
the early stage of transcription initiation. In 
the absence of TFIIH, nascent RNAs are more 
inclined to dissociate from Pol II to become 
abortive RNAs rather than productive RNAs. 
This observation agrees with the proposed 
function of TFHH in pumping the downstream 
promoter to the active site, potentially aiding 
the formation and stabilization of the tran- 
scription bubble (25, 57-62). This phenome- 
non also accounts for the discrepancy between 
eukaryotic Pol II, which exhibits a much lower 
abortive/productive ratio, and bacterial RNA 
polymerase (RNAP), which has no TFIIH 
equivalent and is characterized by a high 
abortive/productive ratio (71, 72). 


RNA-DNA hybrid stability is correlated with 
abortive transcription, escape commitment, and 
promoter escape 


Throughout all the structures, Pol II adopts a 
posttranslocation state with ®°MGMP at the 3’ 
end of the RNA paired with the first down- 
stream cytosine at position +n of the template 
strand (Fig. 4, A and B; figs. S7 and S8; and 
supplementary text). At the nucleotide addi- 
tion (A) site, a free > MGTP pairs with the 
second downstream cytosine at position n+1 
of the template strand. The melted template 
strand exhibits a helical fold with phosphate 
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Fig. 3. The transition from the PIC to the early ITC. (A) Cryo-EM maps of 
the PIC (16) (top left) and a representative early ITC (TC3) (top right) shown with 
Pol Il in a similar orientation. Close-up views of the promoters in the closed 
(bottom left) and open (bottom right) conformations are also shown. CAK is 
positioned on the Pol Il surface in the PIC but is absent in TC3, as indicated by 
the white dashed oval. Although TC2 is almost identical to TC3, it has a less- 
stable RNA and thus was not used for comparison. (B) Superimposition of 
the PIC and TC3 structures with conformational differences indicated by 


backbone packing against a positively charged 
path of the active center cleft. In TCn struc- 
tures, the RNA-DNA hybrids within the active 
center adopt a conformation similar to that of the 
elongation complex (EC) (fig. S8, B and C). Con- 
formational variations exist at the single-stranded 
template strand outside of the active center and 
the 5’ flanking nucleotides of nascent RNA, which 
are flexible owing to the lack of stabilization. 

RNA density is weak in TC2 and TC3 (Figs. 2A 
and 4B and movie S4), consistent with extensive 
abortive transcription on the scp?¢ promoter 
(Fig. 1B and fig. S1B). The short dead-end RNAs 
are prone to dissociate from the halted Pol II as 
a result of less stabilization by the template 
strand and Pol II. RNA dissociation may allow 
the unpaired template strand to retreat and 
guide a new cycle of RNA synthesis. 
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Protrusion 


P, Pro; and T, Thr. 


As the length of the RNA increases in com- 
plexes TC4 to TC9, cryo-EM maps of the RNA- 
DNA hybrid tend to be better resolved (Figs. 
2A and 4B and movie S4), which is indicative 
of a metastable RNA-DNA hybrid correlated 
with escape commitment and a marked de- 
crease in abortive initiation (29-33) (Fig. 1B 
and fig. S1B). Longer RNA makes more con- 
tact with the template strand and the Pol II 
active center cleft, which is rich in positively 
charged residues. We did not observe con- 
formational differences in the Pol II active 
centers of TC2 to TC9. Thus, the transition 
from abortive transcription to escape com- 
mitment may result from an increase in 
RNA length rather than the proposed con- 
formational changes of the transcribing 
complex (31). 
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arrows, from PIC to TC3 (right). Close-up views of the differences are shown 
on the left. The circular close-up view shows fork loop 2 near the active site, 
which moves toward the bridge helix to avoid a clash with the melted 
nontemplate strand. (C) A model of the PIC-ITC transition, with molecular 
motions indicated with arrows. The closed promoter and XPB subunit in the 
PIC are colored gray. Molecules in the ITC are colored as indicated. Single- 
letter abbreviations for the amino acid residues are as follows: G,Gly; |, lle; 


TC10 to TC17 structures reveal well-ordered 
10-bp RNA-DNA hybrids in the active center 
similar to those of EC structures (J/, 73), con- 
sistent with a stable RNA binding and pro- 
moter escape (Fig. 2A, figs. S5 and S8, and 
movie S4). The nascent RNA protrudes out 
of the active center, winds through the open 
channel®, and touches the Pol IT dock (fig. S5C). 
Cryo-EM maps show the growth of RNA in 
TC10 to TC13 (fig. SSF). However, only 13 nt 
could be visualized in TC14 to TC17, and the 5’ 
flanking nucleotides were flexible owing to the 
lack of stabilization. 


The B-finger touches the 5‘ end of nascent 
RNA in the early ITC 


Cryo-EM maps of the early ITC show a puta- 
tive TFUB finger (B-finger, residues 50 to 72, 
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also known as B-reader) in the Pol II active 
center (Fig. 4, B and C; and fig. S9), which has 
been biochemically detected (74, 75) and struc- 
turally observed (24, 40, 43-46). The B-finger 
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of TC3 contacts the template strand at posi- 
tion —2/-—3 and touches the 5’ end of nascent 
RNA, showing an insertion in the active cen- 
ter deeper than that of the previously reported 
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structures (40, 43-46). The B-finger in TC2 to 
TC6 is more separated from the active site and 
less stable as the RNA chain grows and is in- 
visible in TC7 to TC9, which suggests a gradual 


Fig. 4. RNA-DNA hybrid and the B-finger in the 
active center. (A) Cryo-EM maps of the active 
center of TC5 and TC10, which are representative 
ITC and EEC complexes, respectively. Because the 
active center is deeply buried, surrounding modules 
are simplified as background. Close-up views show 
RNA-DNA hybrid and Pol Il residues that contribute 
to hybrid stabilization. Nucleic acids of the template 
strand and nascent RNA are numbered relative to 
the TSS. (B) Transparent cryo-EM maps of the 
RNA-DNA hybrid (left) and B-finger (right) in TC2 to 
TC9. Cryo-EM maps of the B-ribbon and B-linker are 
shown to mark the B-finger boundaries [residues 
Ser®° (S50) and Asn’? (N72)] that remain similarly 
positioned throughout TC2 to TC9. The structural 
model of the B-finger was not built, and a putative 
trace is shown instead. Cryo-EM density of the 
B-finger is more upwardly positioned from TC2 to 
TC6 and is invisible in TC7 to TC9. (C) Cryo-EM 
map of the B-finger and structural model of the 
RNA-DNA hybrid in TC3. Single-letter abbreviations 
for the amino acid residues are as follows: A, Ala; 
D, Asp; H, His; K, Lys; L, Leu; M, Met; Q, Gln; 

R, Arg; S, Ser; T, Thr; V, Val; and Y, Tyr. 
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Fig. 5. Template strand accumulation in channel’ and the ITC-EEC transition. 
(A) Cross-sectional views of the open channel of TC3 (left), the narrow channel’ of 
TC3 in two different views (middle), and the expended channel’ of TC1O (right). 
Proteins and nucleic acids are shown in surface and cartoon representations, 


respectively. The active center and channel’ are circled 


invisible regions of the nontemplate strand are shown as dashed lines. (B) Cross- 
sectional views of channel’ in TC2 to TC10 for comparison. DNA and RNA are 


shown in stick representation. The channel’ is narrow in 


retreat out of the active center. As evidenced 
by its reported effects on transcription activity 
and TSS selection (including TSS scanning in 
yeast) (35, 52, 76-80), the placement of the 
B-finger in the early ITC suggests its suppor- 
tive role in efficient initiation and promoting 
the stability of early ITCs. 


Exit channels of the template and 
nontemplate strands 


The melted template and nontemplate strands 
occupy otherwise empty channels, termed channel” 
and channel, respectively (Fig. 5 and fig. S10). 
As exemplified by TC3 and TC17 structures, 
the weak but noticeable density of the non- 
template strand winds over an open channel’ 
formed by the Pol II lobe, clamp, and pro- 
trusion, which is generally similar to that of the 
Pol II EC (77, 73) and the Pol III pretermination 
complex (87). Consistent with the architecture 
of the open channel, the melted nontemplate 
strand is highly flexible and less restricted. 
This is a conserved feature in eukaryotic Pol 
I and differs from bacterial RNAP, in which 
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destabilized channel" in TC9 


Channel" (TC9), 


by white dashed lines. The 


TC2 to TC9 and largely 


the o factor restrains the nontemplate strand 
and regulates transcription initiation (82, 83). 

The cross-sectional view of the ITC shows 
that the active center merges with the narrow 
channel’ formed by the B-core-N, the B-linker, 
and the Pol II active center cleft, hybrid bind- 
ing, protrusion, rudder, lid, wall, and clamp 
helices (Fig. 5, A and B; and fig. S10C). The 
channel’ is nearly sealed, with two ends capped 
by the RNA-DNA hybrid and the upstream DNA 
duplex. The narrow channel’ may support the 
placement of the template strand in the active 
center that is necessary for RNA synthesis in the 
early ITC. Compared with the empty channel’ 
of the PIC, the DNA-occupied channel" of the 
ITC is slightly enlarged by the displacement of 
Pol II protrusion and B-core-N (Fig. 3B), which 
suggests an adjustable size based on the mol- 
ecules inside. 

In the EEC, TFIIB is invisible and the 
channel’ is no longer sealed and serves as a wide 
and short extension of the active center (Fig. 5, A 
and B; and fig. SIOC). The expended channel’ 
is similar to that of the Pol II EC (11, 73) (fig. 
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expanded in TC10. Accumulation of the disordered template strand in channel’ of 
the ITC (TC4 to TC9) is diagramed with green transparent nucleic acids. Each 
nontemplate strand is shown as a dashed line, and channel" is covered in this view. 
Cryo-EM maps of nucleic acids are shown in fig. S10C. (C) Estimated dimensions 
of channel" and nucleotides for comparison. Channel! is shown in three different views. 
The blue dashed arrow indicates the position of the B-linker in (D). (D) Destabilization 
of the B-linker at the gate of channel’ in TC9. Cryo-EM maps of the B-linker of TC3, TCS, 
TC7, and TC9 are shown in comparable contour levels and are colored in green. 


S5E). The transcription bubble is maintained 
in a fixed length of 11 nt, with the upstream 
(n-10) and downstream (n+2) edges stabilized 
by the Pol II rudder and bridge helix, respec- 
tively (Figs. 2, A and C, and 5; and fig. S4). 


The transition from the ITC to the EEC is 
correlated with GTF dissociation, bubble 
collapse, and promoter escape 


Comparison of the TC9 and TC10 structures 
with Pol II superimposed suggests considera- 
ble conformational changes in the ITC-EEC 
transition (Figs. 2 and 6, Movie 1, fig. S11, and 
movie Si). In TC10, the channel’ is largely ex- 
panded, and extensive contacts between GTFs 
and Pol II are lost. The template and nontemplate 
strands at -11/-1 collapse into a straight DNA 
duplex that merges with the upstream pro- 
moter. Pol II is more separated from the TATA 
box, which is likely unbent because Pol II and 
TFIIB are necessary for stable promoter bend- 
ing (/6). When the upstream promoter is super- 
imposed as a fixed reference, Pol II undergoes 
considerable rotation and displacement by 
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Fig. 6. A working model of transcription initiation. The transitions from the PIC (16) to the ITC and to the EEC are shown in cryo-EM maps of full complexes (left), 
simplified complexes (middle), and schematic models (right) with short descriptions. T and NT represent template and nontemplate strands, respectively. TBS 
represents the TBP-binding sites on TATA box and TATA-less promoters. The complexes from top to bottom are shown with the upstream promoter aligned to 
indicate a transcribing Pol Il relative to the promoter. The open PIC and ITC-EEC transition schematics indicate proposed states by connecting the related structures. 


~110 A, which suggests that a process of pro- 
moter escape occurred on an active gene. As a 
result, Pol II proceeds on the promoter and the 
complex turns into an EEC. 

The above result generally agrees with earlier 
biochemical studies that are not fully consistent 
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with each other. For example, by detecting 
changes in the transcription bubble, complex 
stability, or dissociation of GIFs (mainly TFIIB), 
previous studies showed that the functional 
transition occurs when RNA reaches 7 (35), 
8 (34), 9 (37), 10 (33), 7 or 9 (38), 12 or 13 (63), 
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or more than 9 or 10 nt (36, 57). This apparent 
discrepancy may result from the heteroge- 
neity of transcription initiation, variations in 
promoters and experimental contexts, and 
limitations of the detection method (see sup- 
plementary text). 


8 of 13 


RESEARCH | RESEARCH ARTICLE 


To test the ITC-EEC transition on another 
promoter, we assembled TC9 and TC10 on the 
AdML’S and AdML?°S promoters, respectively, 
and determined cryo-EM structures (fig. S12 
and data S2 to S4). The two structures are 
similar to their counterparts on the SCP°¢ and 
SCP? promoters, respectively, which suggests 
a common point of conformational transition. 
Thus, comparing TC9 and TC10 structures in a 
similar experimental context may serve as a 
framework to unite previous studies and pro- 
vide structural insights into the dynamic pro- 
cesses of the ITC-EEC transition, a rate-limiting 
step in transcription initiation (30, 32, 34). 


GTFs remain associated with the early EEC, and 
TFIID dissociates from the late EEC 


Cryo-EM maps of the EEC show the density of 
the Pol II-associated TFIIF and TFUE (fig. S5B). 
As previously reported, TFIIE and TFIIF in the 
EEC are incompatible with DRB sensitivity- 
inducing factor (DSIF) in a paused elongation 
complex (PEC) (84) and an elongation factor 
super elongation complex (SEC) in SEC-Pol II 
complex (85). Thus, the retained TFIJE and 
TFIIF may function not only in supporting 
initiation but also in preventing the negative 
effects of pausing and elongation factors before 
the transition from the EEC to the PEC or the 
productive EC. 

To investigate whether other GTFs dissociate 
from transcribing complexes, we assembled 
complexes of TC3 (early ITC), TC9 (late ITC), 
TC10 (early EEC), and TC17 (late EEC), fol- 
lowed by glycerol gradient ultracentrifugation 
(fig. S13A). Peak fractions (as in cryo-EM sam- 
ple preparation) of each complex were sub- 
jected to Western blotting using antibodies 
against representative Pol II and GTF subunits, 
including RPB2 (Pol II subunit), TFIIB, TAF1 
and TBP (two TFIID subunits), TFIIEB, TFIIFo, 
and XPB (TFIUH subunit). All the GTFs main- 
tained their association with both the early ITC 
and the late ITC in consistent stoichiometry. 
TFIID partially dissociated from the early EEC, 
whereas other GTFs remained in a stable asso- 
ciation. In the fractions of the late EEC, TAF1 
and TBP of TFIID were nearly undetectable, 
and there appeared to be a reduction in the 
levels of TFIIB, TFIF, TFITE, and TFIIH. 

The above results agree with our structural ob- 
servation and provide complementary insights 
into GTFs in transcribing complexes (fig. S13B). 
TFIID seems to bind the exposed downstream 
promoter elements (DPEs) (positions 28 to 32) 
in TC10, which become buried within Pol II 
channel” and inaccessible to TFIID in TC17. 
TFIID in the apo form may bind TBP and in- 
hibit its binding to the promoter (6, 86, 87). 
TFIIH may tether with Pol II through the bind- 
ing of TFIIE and TFIID. TFIIB may disengage 
from Pol II while maintaining partial connec- 
tions with TBP, TFUF, or the TFIIB-recognition 
element (BRE) of the upstream promoter. The 
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presence of GTFs within the EEC could poten- 
tially vary across different promoters owing to 
differences in GTF-binding elements. Their re- 
tention may be regulated by elongation factors, 
Mediator, CTD phosphorylation, or chroma- 
tin. Nevertheless, the GTFs that are associated 
with the EEC in peripheral positions might 
not contribute to the transcription reaction. 


Template-strand accumulation in channel’ and 
NTP-driven translocation may promote the 
ITC-EEC transition 


The ITC-EEC transition may result from rel- 
ative motion between nucleic acids and Pol II, 
a process of RNA-DNA translocation that is 
coupled to NTP loading, pyrophosphate release, 
and conformational changes of the bridge helix 
and trigger loop (7, 8, 13, 66, 67). We next in- 
vestigated the correlation between RNA-DNA 
translocation and the ITC-EEC transition. Cryo- 
EM maps of TC2 and TC3 revealed the contin- 
uous density of template strands of 14 and 15 nt 
in length, respectively, spanning the upstream 
and downstream edges of the transcription 
bubble (Figs. 2A and 5B and fig. S10C). How- 
ever, in TC4 to TC9, although the template 
strand becomes longer, only ~10 nt within the 
active center could be modeled, whereas 6 to 
11 nt (in TC4 to TC9, respectively) outside of 
the active center are invisible. The disordered 
template strand does not appear to squeeze 
out of the nearly sealed channel’, and its flexi- 
bility suggests nonspecific placement of the 
unpaired nucleotides in channel’. In TC9, the 
Pol II active center is occupied by the 9-bp RNA- 
DNA duplex, and the 11-nt single-stranded tem- 
plate strand (and potentially with the B-finger) 
is likely accumulated in the channel’. The ir- 
regular channel’ exhibits a volume of ~10 A by 
15 A by 20 A and may not accommodate single- 
stranded DNA longer than 11 nt (Fig. 5C). 
The above analysis suggests a template stand- 
scrunching mechanism for the transition from 
the ITC to the EEC (Fig. 5B, Movie 1, fig. S10C, and 
movie S1). In the late ITC, RNA chain extension 
and RNA-DNA translocation lead to the accu- 
mulation of the template strand in the narrow 
channel". The addition of one more nucleotide 
to the RNA chain and NTP-driven RNA-DNA 
translocation may provide a final push to balloon 
out the channel’. Because Pol II remains rigid, 
the B-linker and B-core-N dissociate from Pol II 
to release the accumulated template strand. 
In support of this hypothesis, an earlier study 
showed a decrease in the complex stability on 
the AUML™ to AdML”* promoters (35). Cryo-EM 
density of the B-linker at the gate of channel? 
is continuous in TC3 and TC5 but becomes less 
ordered and disconnected in TC7 and TC9, 
which indicates the destabilization of channel” 
(Fig. 5D). Moreover, TFIIB dissociates from Pol 
II before RNA reaches 10 nt in length (38, 39). 
Previous studies in yeast Pol II have proposed 
that steric clashes between RNA and the B- 
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ribbon may lead to TFIIB dissociation (44, 46). 
This is not likely the case in mammals because 
RNA shorter than 12 nt does not reach the B- 
ribbon (fig. S5D). 

More supporting evidence comes from a 
comparison of the mammalian ITC structures 
with the structure of the yeast ITC on a G-less 
promoter (G is at +26) (47) (fig. S6 and supple- 
mentary text). The yeast ITC structure repres- 
ents a backtracked complex containing a 5-nt 
nascent RNA in the active center. The struc- 
ture shows a partially open channel’, which may 
allow the template strand of more than a hun- 
dred nucleotides to thread out of the complex 
for TSS scanning. The differences in channel" 
(sealed versus open) and the template strand 
(which is melted within the catalytic pocket by up 
to 21 nt in the mammalian ITC versus threaded 
out of the catalytic pocket by more than a hun- 
dred nucleotides in yeast ITC) highlight the 
regulatory role of channel in potentially con- 
trolling bubble size in mammals and suggest 
distinct underlying mechanisms of transcrip- 
tion initiation in mammals and yeast (48, 49). 


Comparison with transcription initiation in bacteria 


Our proposed mechanism of transcription ini- 
tiation in mammals is generally similar to the 
well-characterized transcription initiation by 
bacterial RNAP, which forms CC, OC, ITC, and 
EC complexes and undergoes promoter open- 
ing, bubble expansion, bubble collapse, and 
promoter escape through a DNA-scrunching 
mechanism (88-97). The bacterial o factor, which 
is functionally equivalent to eukaryotic GTFs, 
associates with RNAP and stabilizes the upstream 
promoter in the ITC. During promoter escape, 
the o factor is forced to dissociate from RNAP 
by the scrunched DNA in the transcription bub- 
ble (92) and by the compressed o-finger because 
of RNA growth (93-95). Distinct from what oc- 
curs in the mammalian system, the bacterial 
o factor recognizes the single-stranded non- 
template DNA and contributes to promoter 
escape (82, 83) that is substantially variable 
dependent on the types of promoters (71, 72). 


Discussion 


Studies over the past three decades have estab- 
lished that there are functional transitions during 
eukaryotic transcription initiation. However, 
understanding this process at the molecular 
level has been a long-standing challenge because 
of the difficulty in obtaining de novo transcrib- 
ing complexes, which are highly dynamic and 
heterogeneous (30, 54-56). Our work reconciles 
earlier studies and provides structural visualiza- 
tion of transcription initiation in action (Fig. 6, 
Movie 1, and movie S1). As detailed above, the 
transition from the PIC to the early ITC involves 
promoter melting, transcription bubble for- 
mation, nascent RNA synthesis, and abortive 
transcription. From the early ITC to the late 
ITC, the transcription bubble expands and RNA 
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is more stable within Pol II, which is correlated 
with escape commitment. From the late ITC to 
the EEC, the complex undergoes abrupt transi- 
tions, including channel’ expansion, GIF disso- 
ciation, bubble collapse, and promoter escape. 
The proposed model may be applicable to 
Pol II-mediated transcription initiation on TATA- 
less promoters, which account for most core 
promoters (7). TATA-less promoters tend to 
exhibit lower transcriptional activity than TATA 
box promoters and often result in heteroge- 
neous TSSs, limiting their usage in biochemical 
and structural studies. Our previous study 
showed that during PIC assembly, TFIID sup- 
ports TBP to load and bend the upstream pro- 
moter region, termed the TBP-binding site (TBS), 
regardless of whether it contains a TATA box 
or not (J6). PIC complexes assembled on the 
TATA box and TATA-less promoters adopt 
similar organizations, which suggests a general 
mechanism of transcription initiation. Although 
the selection of a TSS is fine-tuned by the 
surrounding sequences and the inherent het- 
erogeneity (50-52), transcription seems to be 
started ~31 bp downstream of the TBS (35, 48), 
which is primarily determined by the site of 
PIC assembly, which is dependent on promoter 
sequence and chromatin organization (J, 14). 
Our study also provides a structural basis for 
understanding why transcription initiation 
undergoes such notable changes. To avoid pro- 
miscuous transcription, Pol II by itself could 
neither find a target promoter nor open a pro- 
moter to initiate RNA synthesis. This is why 
GTFs are necessary for promoter recognition 
and transcription initiation. However, the sta- 
bility that results from the multiple interactions 
among GTFs, Pol II, and the promoter also 
creates an obstacle for Pol II to escape the pro- 
moter and therefore must eventually be dis- 
rupted. The modular organization of the PIC 
and the ITC is well suited to achieve this pur- 
pose. In the early ITC, the active center and 
channel’ can accommodate the template strand 
and allow for bubble expansion and initial 
RNA synthesis. In the late ITC, the RNA-DNA 
hybrid is stable and long enough to support 
NTP-driven translocation and disruption of the 
GTF-Pol I-promoter contacts. In the EEC, the 
peripherally positioned GTFs would not restrain 
Pol II on the promoter. Therefore, coordinated 
changes of the GTFs and the transcription bub- 
ble support a functional transition from the ITC, 
which requires GTFs and promoter opening for 
initial RNA synthesis, to the EEC, which requires 
GIF dissociation and bubble collapse for pro- 
moter escape and stable RNA production. 


Materials and methods 

Cryo-EM sample preparation 

Pol II was isolated from Sus scrofa thymus, 
and Homo sapiens TFIID, TFIIA, TFIIB, TFIIF, 
TFUE, and TFIIH were prepared as previously 
described (15, 16). Promoter sequences (SCP™* 
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and AUML°°°) are listed in data SI. The pro- 
moter DNA was generated by annealing the 
template strand with equimolar amounts of 
single-stranded nontemplate DNA at a final 
concentration of 50 uM in water. The annealing 
reaction was carried out at 95°C for 10 min and 
gradually cooled down to 25°C within 2 hours. 

To prepare each TC complex, the purified 
PIC components of 300 pmol of TFIID, 1500 
pmol of TFIIA, and 450 pmol of promoter DNA 
(SCP?¢ and AUML9@°S) were mixed at 4°C for 
2 hours in a 150-ul mixture containing 25 mM 
HEPES-KOH pH 7.9, 100 mM KCl, 6 mM MgCl, 
2 mM DTT, and 5% (v/v) glycerol. After incu- 
bation, the mixture was subjected to glycerol 
gradient ultracentrifugation. The gradient was 
generated from equal volumes of a 10% glycerol 
solution [10% (w/v) glycerol, 30 mM HEPES-KOH 
pH 7.9, 50 mM KCl, 6 mM MgCly, 2 mM DTT] 
and a 50% glycerol solution [50% (w/v) glycerol, 
30 mM HEPES-KOH pH 7.9, 50 mM KCl, 6 mM 
MgCl, 2 mM DTT] with a BioComp Gradient 
Master 108 (BioComp Instruments). The ultra- 
filtration was performed using a swinging 
bucket rotor SW60Ti (Beckman) at 36,000 rpm 
for 14 hours at 4°C. Peak fractions containing 
promoter DNA-TFIID-TFIIA were pooled and 
concentrated to ~1.5 mg/ml. Subsequently, the 
100 pmol of preassembled promoter DNA-TFIID- 
TFIIA was mixed with 500 pmol of TFIIB, 
110 pmol of Pol II, and 160 pmol of TFIIF at 4°C 
for 1 hour in a 150-1] mixture containing 25 mM 
HEPES-KOH pH 7.9, 75 mM KCl, 6 mM MgCl, 
2mM DTT, and 5% (v/v) glycerol. Next, 300 pmol 
of TFIITE and 200 pmol of TFITH were added 
to the mixture, and it was further incubated 
for 2 hours to assemble the PIC complex. After 
the incubation, 15 ul of 20x NTP mix [contain- 
ing 40 mM ATP, 40 mM CTP, 40 mM UTP, 
2 mM 3’-O-methylguanosine-5'-triphosphate 
(APEXBIO)] was added to the 285 ul of PIC 
mixture to initiate the transcription reaction. 
The reactions were carried out for 50 min at 
30°C and then quenched by adding 0.5 ul of 
0.4 mM a-amanitin, followed by GraFix (96). 
For Grafix, the glycerol gradient was prepared 
using a low buffer containing 15% (v/v) gly- 
cerol, 30 mM HEPES-KOH pH 7.9, 100 mM KCl, 
2mM MgCl, 10 uM ZnCly, and 2mM DTT anda 
high buffer containing 35% (v/v) glycerol, 30 mM 
HEPES-KOH pH 7.9, 100 mM KCl, 2 mM MgCl, 
10 uM ZnCly, 2 mM DTT, and 0.01% glutaralde- 
hyde. The samples were centrifuged at 38,000 rpm 
at 4°C for 14 hours using an SW41 Ti rotor 
(Beckman Coulter). Subsequently, fractions con- 
taining cross-linked complexes were quenched 
with 50 mM Tris pH 7.4 (25°C). The peak 
fractions were assessed by negative-stain EM. 
Fractions of interest were pooled, concentrated, 
and subjected to buffer exchange into a buffer 
containing 30 mM HEPES-KOH pH 7.9, 100 mM 
KCl, 2mM MgCl, 10 uM ZnCly, 2 mM DTT, and 
0.8% (v/v) glycerol using a 100-kDa cut-off 
centrifugation filter unit (Amicon Ultra). For 
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each assembly step, the samples were moni- 
tored by SDS-PAGE and visualized by Coomassie 
blue staining. TC3, TC9, TC10, and TC17 were 
assembled and subjected to glycerol gradient 
ultracentrifugation without cross-linking, and 
peak fractions (as in the cryo-EM sample) of 
each complex were subjected to Western blott- 
ing using the indicated antibodies (anti-TAF1 
antibody: A303-505A, Bethyl; anti-RPB2 antibody: 
A5928, ABclonal; anti-XPB antibody:10580-1-AP, 
Proteintech; anti-TFIIFa antibody: 10093-2-AP, 
Proteintech; anti-TBP antibody: A2192, ABclonal; 
anti-TFIEB antibody: A305-054A, Bethyl; and 
anti-TFIIB antibody: A1708, ABclonal). 

For cryo-EM grid preparation, 3 ul of the sam- 
ple at a concentration of ~0.2 1M was applied 
to freshly glow-discharged Quantifoil R1.2/1.3 Au 
holey carbon grids. After incubation of 5 s at a 
temperature of 4°C and a humidity of 100%, the 
grids were blotted for 0.5 to 1.0 s in Vitrobot 
Mark IV (Thermo Fisher Scientific), plunge- 
frozen in liquid ethane, and stored at liquid 
nitrogen temperature. The grids were prepared 
in the H,/O, mixture for 15 s using a Gatan 950 
Solarus plasma cleaning system with a power 
of 5 W. The @ 55/20-mm blotting paper is made 
by TED PELLA and used for plunge freezing. 


In vitro transcription initiation assay 


The transcription reaction was conducted by 
mixing 0.4 pmol of promoter DNA, 0.5 pmol of 
TFIID, 1.5 pmol of TFIIA, 1.5 pmol of TFIIB, 
1 pmol of RNA Pol IL, 1.5 pmol of TFTIF, 1.5 pmol 
of TFIIE, and 0.8 pmol of TFIIH in an 18.8-ul 
reaction mixture containing 25 mM HEPES- 
KOH pH 7.9, 50 mM KCl, 5mM MgClo, 1 mM 
DTT, 10% (v/v) glycerol, 200ug/mL bovine 
serum albumin (BSA), and 8 units of RiboLock 
RNase Inhibitor (Thermo Fisher Scientific). 
The mixture was incubated at 25°C for 30 min. 
After the incubation, reactions were initiated 
by the addition of 1 ul of 20x NTP mix con- 
taining 12 mM ATP, 0.2 mM CTP, 0.02 mM 
UTP, 2.4 mM 3'-O-methylguanosine-5’-triphosphate 
(APExBIO), and 0.2 ul of 10 mCi/ml [o-*’P]- 
UTP (Perkin Elmer). The transcription reac- 
tions were carried out for 50 min at 30°C and 
stopped by heating the reaction mixture to 
68°C for 3 min. In the time-course assay, the 
reactions were stopped at time intervals of 
5, 10, 30, and 50 min. Subsequently, after 
cooling on ice, the reaction mixtures were 
treated with 1 ul of calf intestinal alkaline 
phosphatase (CIAP) (20 U/l, Thermo Fisher 
Scientific) at 37°C for 30 min. The RNA products 
were analyzed by 23% urea polyacrylamide gels, 
and the autoradiograph was obtained by storage- 
phosphor scanning (Typhoon; Cytiva). 


Cryo-EM data collection and image processing 


Detailed parameters of data collection and 
image processing are shown in data S2 and 
S3 and summarized in data S4. A brief descrip- 
tion is provided below, and fig. S2 shows the 
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data collection and image processing of TC5 
and TC10, two representative complexes. 

The cryo-EM grids were loaded onto a Titan 
Krios transmission electron microscope (Thermo 
Fisher Scientific) operated at 300 kV for data 
collection. The cryo-EM images were automati- 
cally recorded by a post-GIF Gatan K3 Summit 
direct electron detector in the super-resolution 
counting mode using Serial-EM with a nominal 
magnification of 64,000x in the EFTEM mode, 
which yielded a super-resolution pixel size of 
0.667 A on the image plane, and with defocus 
values ranging from -1.5 to —2.5 um. Each mi- 
crograph stack was dose-fractionated to 40 frames 
with an exposure rate of 21 e pixel s+, a 
total electron exposure of ~50 e A?, and a 
total exposure time of 4.2 s. 

For TC5 and TC10 complexes, 49,009/ 
58,688 (“/” separates parameters in TC5 and 
TC10) micrographs were collected for further 
processing. Drift and beam-induced motion 
correction were applied on the super-resolution 
movie stacks using MotionCor2 (97) and binned 
twofold to a calibrated pixel size of 1.334A 
pixel '. The defocus values were estimated 
by Gctf (98) from summed images without 
dose weighting. Other procedures of cryo-EM 
data processing were performed within RELION 
v3.1 (99, 100) and cryoSPARC v3 (107) using 
the dose-weighted micrographs. Particles of 
28,786,688/33,235,621 were picked by reference 
autopicking in RELION using high-resolution 
Pol II map [Electron Microscopy Data Bank 
(EMDB) ID 3218] as a 3D reference, 0.05 picking 
threshold, and 180 minimum interparticle dis- 
tance, with other parameters set to default, and 
were subjected to reference-free 2D classifica- 
tion using a mask diameter of 420 A, iterations 
of 30, Tau2Fudge factor of 2, classes of 200, 
e-step of -1 A, and ignore CTFs until first peak 
(Yes) (other parameters were set to default). 
Particles of 19,288,625/21,463,872 were se- 
lected from good 2D classes for 3D classifica- 
tion in RELION. 

All the 3D classification and 3D autorefine 
in RELION used an initial low-pass filter of 40 A, 
Tau2Fudge factor of 4, iterations of 160, classes 
of 9, and e-step of —1 A (other parameters were 
set to default). Particles of 12,743,286/12,467,838 
were selected from 3D classes using a mask 
diameter of 420 A. The yielded particles were 
applied for subtraction with a Pol II mask. The 
subtracted particles were applied for 3D clas- 
sification using a mask diameter of 280 A to get 
the ITC and EEC conformations, respectively. 
The particles of the ITC and EEC were used for 
refinement in cryoSPARC Local Refinement, 
yielding a reconstruction at 2.98 A/2.82 A reso- 
lution for Pol II (including the bound nucleic 
acids). All the refinement of Pol II has been 
subjected to CTF refinement, postprocessing, 
and Bayesian polishing in RELION. The sub- 
tracted particles of Pol II in ITC and EEC con- 
formation were reverted to original particles, 


Chen et al., Science 382, eadi5120 (2023) 


followed by 3D classification to obtain overall 
maps of ITCs and EECs. The overall maps were 
refined with a box size of 200 pixels and a 
down-sampled pixel size of 2.668 A. 

All reported resolutions are based on the gold- 
standard Fourier shell correlation (FSC) = 0.143 
criterion. The gold-standard FSC curves were 
corrected for the effects of a soft mask with 
high-resolution noise substitution. We also 
used the 3D FSC server to evaluate our maps. 
All the visualization and evaluation of the 
3D volume map were performed using UCSF 
Chimera (702), and the local resolution varia- 
tions were calculated using cryoSPARC. 


Model building and structure refinement 


The overall cryo-EM maps and locally refined 
maps of all samples were used for model build- 
ing. The structures of the mammalian PIC 
complex [Protein Data Bank (PDB) ID 7EGB] 
(15) and mammalian elongation complex (PDB 
ID 5FLM) (J) were used as initial structural 
templates. For ITC complexes, structural mod- 
ules of Pol II and GTFs were individually docked 
into the cryo-EM maps by rigid-body fitting 
using UCSF Chimera. For EEC complexes, Pol II 
and TFIIF were docked into the cryo-EM maps, 
whereas other GTFs were not built because the 
density did not support confident model dock- 
ing. The structural models were manually 
adjusted in COOT (103). 

Overfitting of the model was monitored by 
refining the model in one of the two half maps 
from the gold-standard refinement approach 
and testing the refined model against the other 
map. The structural models (except the flex- 
ibly docked TFIIE, TFIIH, and TFIID) were 
evaluated using Phenix real-space refine and 
MolProbity (104) using resolution at 6.5 A for 
the ITC and 3.5 A for the EEC and constraints 
of rotamer, Ramachandran, and secondary 
structure. Map and model representations in 
the figures and movies were prepared by PYMOL 
(105), UCSF Chimera, or UCSF ChimeraX (102). 
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INTRODUCTION: Heterotrimeric guanine nucleotide- 
binding protein (G protein)-coupled receptors 
(GPCRs) bind extracellular ligands to modulate 
intracellular signaling responses. Two fundamen- 
tal properties of ligand-receptor-signaling sys- 
tems are efficacy (the maximum achievable 
response) and potency (the ligand concentration 
required to produce the half-maximal response). 
Although efficacy and potency have been mea- 
sured for numerous ligand-receptor-signaling 
systems for several decades, the molecular de- 
terminants and principles governing these phar- 
macological properties remain a mystery. 


RATIONALE: Understanding how each receptor 
residue contributes to efficacy and potency can 
aid in designing drugs to elicit desired signal- 
ing responses. Using the adrenaline-$2 adre- 
nergic receptor (B2AR)-Ga, system, we perturbed 
the side chain of each of the receptor’s 412 
residues and determined the impact on effi- 
cacy and potency. By developing a data science 
framework that integrates pharmacological 
and structural data, we contextualized ligand- 
induced structural changes and revealed the 
principles of efficacy and potency. 


A 


Determinants and 


RESULTS: Only 20% of the B2AR residues con- 
tribute to the receptor’s pharmacological prop- 
erties. One-third of those pharmacologically 
relevant residues map to ligand- or G protein- 
binding sites or to evolutionarily conserved motifs; 
the other two-thirds are distributed through- 
out the receptor. All ligand-binding residues 
are important for signaling but show differ- 
ential contributions to efficacy and potency, 
which indicates that specific contacts between 
ligand and receptor could be modulated to fine- 
tune pharmacological properties. By contrast, 
only one-third of the receptor residues that con- 
tact the G protein contribute to pharmacology, 
which indicates that these positions tolerate 
mutations and provides an explanation for 
the evolvability of G protein selectivity. 

We integrated structural data about active 
and inactive receptor conformations with phar- 
macological measurements. Not all residues 
that undergo structural change are pharma- 
cologically important, and several pharmaco- 
logically important residues do not undergo 
structural change upon activation. We catego- 
rized all receptor residues into four classes: 
Driver and passenger residues mediate active 


state-specific contacts, and, whereas drivers 
affect pharmacology when mutated, passengers 
do not. Modulator and bystander residues do 
not mediate active state-specific contacts, but 
modulators affect pharmacology when mutated, 
whereas bystanders do not. 

We uncovered an allosteric network of active 
state-specific contacts mediated by driver resi- 
dues from the ligand-binding pocket to the G 
protein-binding interface, thereby identifying 
structural changes that are pharmacologically 
relevant. Modulator residues are located near the 
allosteric network and functional sites. Surface- 
exposed driver, modulator, and passenger resi- 
dues represent key allosteric sites and potential 
targets for the development of new allosteric 
ligands. Analysis of human polymorphisms and 
the conservation of residues across species re- 
vealed that passenger, modulator, and driver 
residues are under increasing selection pressure 
compared with bystander residues in this receptor. 


CONCLUSION: Our work reveals how a GPCR 
decodes and translates the information encoded 
in a ligand to mediate a distinct signaling re- 
sponse. We anticipate that the application of 
this data science framework will enable the 
design of orthosteric and allosteric molecules 
that can elicit defined signaling responses. 
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Heterotrimeric guanine nucleotide—binding protein (G protein)-coupled receptors (GPCRs) bind to 
extracellular ligands and drugs and modulate intracellular responses through conformational 

changes. Despite their importance as drug targets, the molecular origins of pharmacological properties 
such as efficacy (maximum signaling response) and potency (the ligand concentration at half- 
maximal response) remain poorly understood for any ligand-receptor-signaling system. We used the 
prototypical adrenaline-B2 adrenergic receptor—G protein system to reveal how specific receptor 
residues decode and translate the information encoded in a ligand to mediate a signaling response. We 
present a data science framework to integrate pharmacological and structural data to uncover structural 
changes and allosteric networks relevant for ligand pharmacology. These methods can be tailored to 
study any ligand-receptor-signaling system, and the principles open possibilities for designing 
orthosteric and allosteric compounds with defined signaling properties. 


eterotrimeric guanine nucleotide-binding 

protein (G protein)-coupled receptors 

(GPCRs) constitute a major family of 

membrane proteins that respond to di- 

verse extracellular ligands, including 
photons, small molecules, neurotransmitters, 
and hormones (J-5). In humans, more than 
500 endogenous ligands (6) act on 800 GPCRs 
and regulate key aspects of human biology (7, 8). 
As a result of their capacity to modulate hu- 
man physiology, GPCRs are targets of about 
one-third of all US Food and Drug Adminis- 
tration (FDA)-approved drugs for various dis- 
eases (9). Ligand binding to a GPCR induces 
conformational changes that enable modula- 
tion of downstream signaling responses (/0). 
During that process, the ligand shifts the con- 
formational equilibrium of the receptor toward 
a more active or inactive state, depending on 
whether it is an agonist, antagonist, or inverse 
agonist (17, 12). The conformational states as- 
sociated with agonist binding strongly in- 
crease the likelihood of a signaling response 
(e.g., G protein activation). This agonist capac- 
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ity to promote signaling has been classically 
quantified by two fundamental pharmacologi- 
cal parameters: efficacy and potency (73), which 
can be determined by monitoring the receptor 
signaling response at various ligand concen- 
trations (concentration-response curves). Effi- 
cacy relates to the maximum amplitude of a 
signaling response, whereas potency refers to 
the agonist concentration at which signaling 
reaches the half-maximal response. Endoge- 
nous agonist efficacy and potency are within 
a physiologically relevant range as a result of 
evolutionary selection pressure on the tertiary 
structure of GPCRs. Thus, understanding how the 
ligand interacts with the receptor to modulate 
the signaling response is paramount to ratio- 
nalizing the link between endogenous ligand 
properties and receptor sequence and struc- 
ture, which can aid in designing drugs that 
engage specific receptor residues to elicit dis- 
tinct signaling responses in a disease context. 

Numerous structural studies have revealed 
the importance of the movement of specific 
transmembrane helices as a key feature that is 
associated with receptor activation upon lig- 
and binding (/4-16). Furthermore, changes in 
the size of the ligand-binding cavity have been 
associated with high-affinity ligand binding 
and the strength of the signaling response (17). 
Extensive mutagenesis studies have identified 
residues along the receptor sequence that are 
important for modulating downstream sig- 
naling activity (8, 19). Despite the thorough 
pharmacological, structural, and mutational 
characterization of numerous ligand-GPCR 
pairs over the years, we do not yet fully under- 
stand how specific residues of the receptor 
decode and translate the information that is 
encoded in the ligand atoms to mediate a dis- 
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tinct intracellular signaling response (i.e., the 
molecular origins of efficacy and potency). For 
instance, in the well-studied adrenaline-B2 
adrenergic receptor (§2AR)-Gs signaling sys- 
tem, it is unclear whether all residues that 
contact the ligand or the G protein (fig. S1) 
are important for adrenaline’s efficacy and 
potency toward Gs and whether all residues 
that are involved in structural changes during 
activation are important for signaling (fig. S1). 
If not, it would be helpful to identify the resi- 
dues that help convert ligand binding into a 
signaling response and how they contribute to 
the efficacy and potency response of the ligand 
at the receptor. 

To systematically infer how ligand atoms 
are decoded by the receptor sequence and 
structure to modulate downstream signaling 
response, one needs to perturb the side chain 
of every receptor residue, reliably measure the 
impact on efficacy and potency for a signaling 
response, and contextualize the structural 
changes of the receptor residue with its phar- 
macological importance. We used 82AR, a well- 
studied, prototypical family A GPCR that 
mediates the fight-or-flight response when 
activated by its endogenous agonist adren- 
aline (epinephrine) and signals through the G 
protein Gs. By developing a data science 
framework that integrates (i) experimentally 
measured values of adrenaline-stimulated 
Gs signaling upon systematically perturbing 
the side chain of every residue in 82AR and (ii) 
data on structural changes associated with 
agonist binding, receptor activation, and Gs 
coupling (Fig. 1), we reveal the molecular ori- 
gins and principles that govern ligand efficacy 
and potency in this prototypical GPCR signal- 
ing system. 


BRET-based biosensors allow in-depth 
pharmacological profiling of receptor mutants 


We perturbed the side chain of each of the 
412 residues in the receptor by mutating the 
residue to alanine, or glycine if the native 
amino acid was alanine, because both types 
of substitutions are well tolerated in the long, 
membrane-spanning a helices (20) of a GPCR. 
For each mutant, we then evaluated its 
adrenaline-stimulated Gs signaling profile 
in a live-cell assay with a bioluminescence 
resonance energy transfer (BRET)-based bio- 
sensor (21, 22) (fig. S2 and Materials and meth- 
ods). The biosensor reports on the distance 
between the Go and Gy subunits of the het- 
erotrimeric G protein; this distance increases 
upon receptor activation as a result of con- 
formational changes in the o subunit and its 
dissociation from the By subunits, leading to 
a decrease of resonance energy transfer be- 
tween the donor and acceptor that are fused 
to the o and y subunit, respectively. Each as- 
say was performed as 12-point concentration- 
response curves in biological triplicates (Fig. 1, 
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Fig. 1. An approach to integrate pharmacological and structural data to reveal the molecular determi- 
nants of efficacy and potency. (A) Every residue of a receptor is mutated, and key pharmacological properties 


of the adrenaline-B2AR-Gs signaling system are determi 


ned experimentally. (B and C) Active- and inactive- 


state structures of receptors are analyzed as networks of noncovalent contacts between residues to infer contacts 


that are specific for the active and/or inactive state. (D) 


to discover the molecular determinants and the underlyi 


top panel, and Materials and methods) to quan- 
tify the signal amplitude (efficacy) and the log 
mean effective concentration (logECs 9) (po- 
tency) of the adrenaline-stimulated G protein 
activation (fig. S2 and Materials and meth- 
ods). Because a mutation may affect receptor 
abundance and low cell-surface abundance 
could negatively influence the signaling, we 
also evaluated the abundance of all mutants 
by cell-surface enzyme-linked immunosorbent 
assay (ELISA) (fig. $3, A to C, and Materials 
and methods). We performed measurements 
on cells expressing wild-type B2AR or no re- 
ceptor as controls. In total, we obtained more 
than 16,000 signaling data points; the phar- 
macological properties for agonist-induced Gs 
activation for each mutant are reported in table 
S1. For subsequent analyses of these data, we 
used the GPCRdb numbering scheme (23), a 
system based on Ballesteros-Weinstein number- 
ing (24), wherein the first number in the super- 
script refers to the helix or loop, and the second 
number refers to the position relative to the 
most conserved residue (numbered 50) in that 
helix [e.g., L124?**°, which is 7 residues before 
the most conserved residue in transmembrane 


helix 3 (TM3)]. 


One-fifth of the positions in B2AR are important 
for adrenaline efficacy, potency, or both 

For 16 receptor positions, side chain perturba- 
tion severely affected cell-surface abundance 
(<25% of wild-type level; Materials and meth- 
ods). These mutants were excluded from fur- 
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These data are integrated using a data science approach 
ng allosteric network governing efficacy and potency. 


ther analysis because we could not reliably 
determine whether the observed reduction in 
signaling resulted from altered function, re- 
duced abundance, or both (fig. $3, A to C). 
Many of these positions are highly conserved 
and are likely important for protein folding, 
receptor biogenesis, and transport to the cell 
membrane (fig. $3, D to H). The remaining 
396 mutants had cell-surface abundance that 
allowed reliable estimation of pharmacologi- 
cal parameters and were therefore considered 
for further analysis (abundance >25% of the 
wild-type B2AR; Materials and methods). Most 
of the single-point mutations (~80%; at 314 
positions) did not negatively affect Gs signal- 
ing, and ~20% (at 82 positions) had an impact 
on signaling. Of these 82, 21 primarily reduced 
efficacy, 37 primarily reduced potency, 21 re- 
duced both potency and efficacy, and 3 re- 
sulted in no measurable signal (Fig. 2, A and B). 

Mapping these positions on the secondary 
structure elements of the receptor showed 
that TM5 was enriched in mutations that af- 
fected potency only, whereas TM3 was en- 
riched in mutations that affected efficacy only. 
The mutations that affected efficacy and po- 
tency simultaneously were distributed across 
the different transmembrane helices (fig. S4, 
A to D). Although some of the identified po- 
sitions overlap with functional residues in the 
ligand-binding pocket and the G protein- 
binding interface, several positions mapped 
to other parts of the receptor, including resi- 
dues on the receptor surface, with the highest 
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Fig. 2. Receptor positions that affect efficacy or 
potency upon mutation. (A) Overview of potency 
(normalized logECs9) and efficacy (normalized 
signal amplitude) of all B2AR mutations. Cutoffs 
are shown as dashed lines (Materials and methods). 
(B) Distribution of residues by functional 

relevance and their effect on efficacy and potency 
(Materials and methods). Ligand-binding pocket 
and G protein-binding site include residues within 4 A 
of adrenaline (PDB ID: 4LDO) or Gs (PDB ID: 3SN6), 
respectively. (©) Accessible surface area of residues, 
grouped by whether a mutation affects signaling 
(effect) or not (no effect). For residues where signaling 
was affected by mutation, the location of mutations in 
the ligand-binding site, in the core, in the G protein— 
binding site, or in none of the above (other) is 
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density mapping to the receptor core (Fig. 2, 
B and C). These findings collectively indicate 
that the efficacy and potency of this endoge- 
nous agonist at this receptor are likely gov- 
erned by a subset of receptor positions (~20%) 
that are not just restricted to known func- 
tional sites but are also distributed in differ- 
ent parts of the structure. These receptor 
positions thus decode and translate ligand bind- 
ing into a signaling response and are, hence, 
pharmacological determinants for this ligand- 
receptor-signaling system. 
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Receptor residue-ligand atom contacts 
contribute differentially to efficacy 

and potency 

We investigated how each ligand-contacting 
residue contributes to efficacy and potency. 
For this, we first identified all receptor resi- 
dues that contact adrenaline using a distance 
cutoff [Protein Data Bank (PDB) ID: 4LDO 
(25)] (Fig. 3A and Materials and methods) and 


the ligand atoms that they contact by construc- 
ting a receptor residue-ligand atom contact 
matrix (Fig. 3B). Although mutants of all 10 of 
the 11 adrenaline-contacting residues that ex- 
pressed well negatively affected the signal- 
ing response, they had distinct effects on 
efficacy and potency (Y316™*"A markedly de- 
creased receptor abundance; fig. S3). Muta- 
tions in around half of the positions (6 of 10) 
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are indicated in violet. The following are not 


labeled for clarity: V114®°°, $203°8, and F289°!. (B) Receptor residue-ligand atom contact plot with the ligand 
atoms on the x axis and the receptor’s GPCRdb and residue numbers on the y axis. The number of noncovalent 


contacts between a receptor residue and ligand atom is shown in 


each square of the heatmap. The chemical 


structure of adrenaline below the heatmap indicates the labeling of adrenaline atoms used for the x axis. Box 
colors in the heatmap refer to the pharmacological effect of the mutation [efficacy affected (light blue), potency 
affected (ocean blue), both efficacy and potency affected (dark blue), or no measurable signaling (black)]. 

The number of receptor residues contacted by each ligand atom and the number of ligand atoms contacted 

by each receptor residue are indicated in boxes at the top and right-hand side of the heatmap, respectively. 
(C) Surface view of the ligand-binding site, where the surface of each receptor residue is colored by the effect that 
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affected potency only, one affected efficacy 
only, two affected both efficacy and potency 
when mutated, and D113°**?A completely abo- 
lished signaling. D113?*”? is critical for agonist 
binding in B2AR (26, 27) and other aminergic 
receptors because it contacts the positively 
charged amine group, which is a common fea- 
ture of endogenous agonists for monoamine 
receptors. When this information was map- 
ped onto the ligand-binding pocket (Fig. 3C), 
it was obvious that the ligand-contacting resi- 
dues that affect potency only upon mutation 
group together on one side of the receptor 
involving TM3, TM5, and TM6, whereas those 
that affect both efficacy and potency upon 
mutation group together on the other side of 
the receptor, largely involving TM6 and TM7. 
The ligand-contacting residue that affected 
efficacy only is in TM3 and is deeply buried 
in the binding pocket. These results indicate 
that each receptor residue that contacts the 
ligand contributes differentially to the overall 
pharmacological response. Of the 159 A? of re- 
ceptor surface area occupied by the ligand, 
20% (31.5 A’) appears to be important for both 
efficacy and potency, 6% (9 A’) for efficacy 
only, and 59% (93 A”) for potency only. Thus, 
the pharmacological parameters of a ligand at 
a receptor seem to emerge from the differ- 
ential contribution of each receptor residue 
to the overall efficacy and potency in response 
to a ligand. 

From the ligand perspective, the distance- 
based contact map shows that some adrena- 
line atoms are in close proximity to residues 
that only affect potency when mutated (the 
oxygen atoms in the hydroxyl groups and the 
aromatic carbon C3; Fig. 3B). The map also 
shows that the aromatic atoms Cl, C2, C5, 
and Cé6 are in close proximity to residues that 
affect potency only, efficacy only, or both when 
mutated. The remaining ligand atoms make 
extensive contact with two residues that affect 
both efficacy and potency when mutated, in- 
cluding the N312°* and the key D113”*”” (03, 
C7, C8, N1, and C9). These findings reveal the 
existence of spatially clustered groups of ad- 
renaline atoms that mediate contact with spe- 
cific receptor residues to preferentially affect 
efficacy and/or potency responses in recep- 
tor signaling. An implication of this obser- 
vation is that chemical modifications of these 
atoms that can mediate contacts with specific 
receptor residues might allow a more targeted 
ligand modification to alter the efficacy or po- 
tency of Gs activation. This can be seen from 
the receptor residue-ligand atom contact mat- 
rix for the B2-adrenergic ligand salmeterol (fig. 
S5). Together, these results indicate that, al- 
though all ligand-contacting residues are im- 
portant for receptor signaling, distinct receptor 
residues and specific chemical groups on the 
ligand contribute differentially to efficacy and 
potency responses. Thus, although efficacy 
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and potency are measured as a general ligand 
property at a receptor, they arise from a collection 
of receptor residue-ligand atom noncovalent 
contacts, each of which contributes differen- 
tially to pharmacological properties in down- 
stream signaling response. 


Only one-third of the G protein-contacting 
residues are important for efficacy and potency 


From the structures of the B2AR-Gs complex 
(PDB IDs: 3SN6 and 6E67), we observed that 
27 receptor residues make 49 noncovalent con- 
tacts with 24 G protein residues (Fig. 3, D to F, 
and Materials and methods). Only one-third 
(9 of 27) of the receptor interface residues 
showed an effect on G protein activation when 
mutated. This is in stark contrast to the ligand- 
binding residues, where all positions were 
important for receptor signaling (Fig. 3B). Of 
the positions in the G protein-binding site, 
four affected potency only, two affected effi- 
cacy only, and three affected both when mu- 
tated. From the G protein perspective, key 
conserved residues that undergo conforma- 
tional changes during receptor binding [e.g., 
H5.12,15,16,19,20,25, S3.1, and S1.2 in the com- 
mon G protein numbering (CGN) scheme (28)] 
mediate extensive contacts with receptor resi- 
dues that affected efficacy, potency, or both 
when mutated (28, 29). Collectively, these re- 
sults reveal that there is a subset of receptor- 
interface residues that translate ligand potency 
and efficacy into G protein activation at the 
receptor-G protein interface. A large fraction 
of receptor residues that contact the G protein 
can be mutated to alanine with little effect on 
efficacy or potency and are thus evolvable. The 
latter findings imply that upon gene dupli- 
cation and divergence, receptors can tolerate 
certain types of mutations at the G protein- 
binding interface without losing their capacity 
for signal transduction. Such a structure-to- 
function architecture may provide an addi- 
tional explanation as to why GPCRs could 
evolve by gene duplication and acquire new 
G protein selectivity (30, 37). 


Conserved motifs only explain a fraction of 
residues important for signaling 


Of all the positions that had an influence on 
ligand efficacy and potency, only a fraction 
(19 of 82 residues; 24%) were in the ligand- 
binding and G protein-binding sites. A de- 
tailed analysis of the key motif residues of 
GPCRs, as well as the most highly conserved 
positions (x50 positions in each helix) (Mate- 
rials and methods), accounted for an addi- 
tional eight residues (10%) (fig. S6). Key 
sequence and structural motifs with known 
functional roles include CWxP, PIF, NPxxY, 
and DRY (where each uppercase letter is an 
amino acid residue, and each “x” represents 
any amino acid). However, two-thirds of the 
pharmacologically important residues (55 of 
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82 residues) do not map to known functional 
sites or motifs and are hence not readily inter- 
pretable. These positions most likely represent 
receptor- or receptor family-specific switches 
or allosteric conduits that contribute to trans- 
lating ligand binding into signaling. 


Structural changes alone do not specify the 
pharmacological importance of a residue 


Because key structural changes have been ex- 
tensively described as a hallmark of receptor 
activation (14, 16, 32, 33), we investigated how 
those structural changes are associated with 
pharmacologically important residues. We cal- 
culated how much each residue moved during 
activation by computing the distance between 
the a-carbon atoms of each residue in the in- 
active (highest-resolution structure, 2.4 A; 
PDB ID: 2RH1) and active, G protein-bound 
state (highest-resolution structure, 3.2 A; PDB 
ID: 3SN6). The distribution of the distance 
moved by each residue is skewed to the right, 
where in ~8% of the cases, the o-carbon atom 
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moved more than 4A (22 residues; fig. SIB). Of 
these, only seven were pharmacologically im- 
portant and affected efficacy or potency when 
mutated, whereas the rest had no impact on 
downstream signaling (Fig. 4, A and B). Fur- 
thermore, comparing the set of residues that 
were pharmacologically important with those 
that were not revealed that an approximately 
equal proportion between the two sets ap- 
peared to undergo a structural change (i.e., 
move >4 A). Finally, an analysis of dihedral 
angles in the active and inactive states also 
revealed no association between the extent of 
change in the dihedral angle and pharmaco- 
logical importance of a residue (fig. $7). These 
findings collectively suggest that it is not pos- 
sible to infer the pharmacological importance 
of a residue just from the extent of structural 
change (translation or rotation) during recep- 
tor activation. 

Because the formation or breaking of non- 
covalent contacts during activation comprises 
both translational (distance) and rotational 
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Fig. 4. Structural changes and pharmacological importance of residues. (A) Distance between Ca atoms 


in the inactive state (PDB ID: 2RH1) versus the active, G protein-bound state (PDB ID: 3SN6) plotted against 
normalized Gs logECso of the alanine or glycine mutant in that position. White and gray areas indicate residues with 
potency affected and not affected, respectively, upon mutation. (B) Distance between Ca atoms plotted against 
normalized Gs signal amplitude. White and gray areas indicate residues with efficacy affected and not affected, 
espectively, upon mutation. (C) Active state-specific contacts (Materials and methods) are shown on the active, G 
protein-bound receptor structure. Potency and efficacy values for all mutants are plotted (except those with low 
abundance and those for which no signaling was detected). Mutants are colored green if the mutated residue is 
involved in an active state-specific contact and black if it is not. White areas indicate residues with efficacy, potency, 


or both affected, and the gray area indicates those that were not affected upon mutation. Numbers in the plots 
denote the number of residues in each category (black or green). (D) Inactive state-specific contacts on the inactive- 
state structure. A mutant (data point in the plot) is colored orange if the residue is involved in an inactive state— 
specific contact and black if it is not. White and gray areas of the graph are defined as in (C). 
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(dihedral angle) changes to different extents, 
we focused on residues that make active state- 
specific contacts or break inactive state-specific 
contacts to assess whether they can inform 
pharmacological importance (Fig. 4, C and 
D; Materials and methods; and table S82). 
Considering those two categories, the for- 
mation of an active state-specific contact by 
a residue showed a higher association with 
pharmacological importance. For instance, 
~50% (41 of 80 residues with structural data) 
of residues that are pharmacologically impor- 
tant participated in an active state-specific 
contact, whereas only 20% (35 of 178) of the 


pharmacologically unimportant residues did. 
This effect was not observed for inactive state- 
specific contacts, for which ~50% (39 of 80) of 
pharmacologically important residues with 
structural data and 47% (83 of 178) of un- 
important residues participated in an inactive 
state-specific contact. These findings indicate 
that systematic integration of mutational ef- 
fects and structural analyses considering ac- 
tive state-specific residue contacts may help 
identify and characterize the subset of pharma- 
cologically important residues that undergo a 
structural change in this ligand-receptor sys- 
tem. Notably, it can explain which of the struc- 


tural changes that happen during activation 
are relevant for ligand efficacy and potency at 
the receptor. 


Linking structural changes with pharmacological 
importance reveals new roles for residues 


To identify the structural changes relevant for 
efficacy and potency, we devised an approach 
to integrate the experimentally determined 
pharmacological parameters with the active 
state-specific contacts (Fig. 54 and Materials 
and methods). We first classified every residue 
as pharmacologically important (if it affects 
efficacy, potency, or both when mutated) or 
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Fig. 5. The allosteric contact network for efficacy and potency as determined 
by structure and pharmacology. (A) Classification of residues according to their 
pharmacological importance (whether a mutation affected signaling or not) and 
structural relevance (whether a residue forms an active state-specific contact or 
not) into four main classes: bystanders (white), passengers (wheat), modulators 
(slate), and drivers (green), the latter of which is subdivided into connected 
drivers (driver residues connected to other driver residues) and unconnected driver 
residues (Materials and methods). (B) Position of residues classified by their 
participation in active state-specific contacts and their importance for pharmacology 
on a two-dimensional (2D) top-view projection of the active, G protein-bound 
structure of the B2AR (PDB ID: 3SN6). Frequency of residues along the x and y axes 
of the receptor (plane parallel to the membrane) is shown on the distributions 
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outside. Arrows denote the direction of the peaks of the distribution for the residue 
classes. Driver residues tend to be in the center, whereas bystander residues tend 
to be at the periphery. (C) Integration of pharmacologically important residues 
and the residues contributing to newly established contacts in the active-state 
structure yields the allosteric network for efficacy and potency (see main text). 
(D) Residues forming the allosteric networks for efficacy and potency are 
represented as a cartoon, colored by the effect of the mutation. The graph (left) 
depicts the frequency of potency versus efficacy effects for residues in the network, 
projected on the axis running along the extra- to intracellular side. Small circles 
and their sizes (scaled to the relative strength of effect) within each residue indicate 
the magnitude and effect on potency and efficacy. Edge thickness in the network 
corresponds to the number of contacts, ranging from 1 to 5. 
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not pharmacologically important (if it has 
no effect on either efficacy or potency). We 
then stratified according to whether a resi- 
due forms an active state-specific contact or 
not (i.e., whether it is structurally important 
or not). On the basis of these two properties, 
we defined four classes of residues: (i) driver 
residues that mediate an active state-specific 
contact and affect pharmacology (41 residues), 
Gi) modulator residues that do not form an 
active state-specific contact but are important 
for pharmacology (41 residues), (iii) passenger 
residues that mediate an active state-specific 
contact but are not important for pharmacol- 
ogy (35 residues), and (iv) bystander residues 
that neither mediate active state-specific con- 
tacts nor affect pharmacology when mutated 
(278 residues) (Materials and methods and 
Fig. 5A). Thus, the pharmacologically impor- 
tant residues are either drivers or modulators 
(82 mutations that affect pharmacology), and 
the structurally important residues are either 
drivers or passengers (76 residues that medi- 
ate an active state-specific contact). As a gen- 
eral pattern, driver residues tend to be close 
to the central axis going through the receptor 
and perpendicular to the membrane, whereas 
modulator and passenger residues are located 
further away from this axis, in proximity with 
solvent or membrane-exposed positions (Fig. 
5B). These observations prompted us to inves- 
tigate the existence of a network of pharma- 
cologically relevant active state-specific residue 
contacts mediated by driver residues and fur- 
ther understand the role of modulator resi- 
dues for signal transduction. 


A subset of residues driving efficacy and 
potency form an allosteric contact network 


Of the 41 driver residues, 23 form 15 active 
state-specific noncovalent contacts to other 
driver residues with a shared pharmacological 
effect (connected drivers) (Materials and meth- 
ods). This allosteric network of active state- 
specific contacts mediated by driver residues 
allowed us to describe how potency and effi- 
cacy signals are transmitted across the re- 
ceptor structure at the residue level through 
coordinated structural rearrangements (Fig. 
5C and fig. S8). The network involving con- 
nected drivers started extracellularly in TM5 
and TM6 and formed several contacts between 
TM5 and TM6, including with F282 of the 
PIF motif (34, 35) (Fig. 5D), before reaching 
TMS in the receptor core. TM3 further con- 
nected to TM7 and the most conserved residue 
in TM2, D79?*°° (26), which is part of the al- 
losteric sodium-binding motif (36) (Fig. 5D). 
Finally, the network reached residues in the 
intracellular region, including D130**° and 
Y326"°? of the conserved DRY and NPxxY 
motifs in TM3 and TM7, respectively (Fig. 5D). 
Potency-affecting driver residues in the net- 
work were enriched at the extracellular side of 
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the receptor involving TM5 and TM6 (total of 
9 residues; Fig. 5D), and efficacy-affecting driver 
residues were enriched at the intracellular side 
of the receptor in TM7, TM2, and TM3 (total of 
5 residues; Fig. 5D). Driver residues in the 
allosteric network that affected both efficacy 
and potency were enriched in the center of 
the receptor, connecting to potency-only and 
efficacy-only residues (9 residues; Fig. 5D). 
This raises the question as to how ligand bind- 
ing to the receptor initiates and transmits 
information through the allosteric network 
to modulate downstream signaling. 


Key contact-rewiring events initiate and 
transmit information to mediate signaling 


To understand how information is transmitted 
in the adrenaline-B2AR-Gs signaling system, we 
analyzed the relation between the ligand- and G 
protein-binding residues and the allosteric 
network. Focusing on the ligand-receptor inter- 
face, the allosteric contact network included 
three residues directly located in the ligand- 
binding site (N293°°°, F290°", and S207", 
all affected potency only when mutated); the 
other eight ligand-binding residues do not 
form active state-specific contacts. This high- 
lights the existence of residues in the ligand- 
binding pocket that directly participate in the 
allosteric network. It also highlights the dis- 
tinct roles mediated by the ligand-contacting 
residues on the receptor: although several re- 
sidues stabilize ligand-receptor interactions, a 
subset of three driver residues that also con- 
tact the ligand likely initiate the allosteric 
changes during receptor activation (Fig. 5 and 
fig. S9). The latter set of three residues makes 
more extensive contact with other receptor 
residues compared with adrenaline. By con- 
trast, ligand-binding residues that do not 
mediate active state-specific contacts (e.g., 
modulator residues D113°*** and N312”**°) 
make substantial contacts with adrenaline and 
fewer contacts with other receptor residues 
(fig. S9). Thus, some receptor and ligand atom 
contacts might contribute to the affinity of lig- 
and binding, whereas others might have a more 
direct role in triggering the allosteric network 
to transmit the signaling response, perhaps 
through initiating a structural change and 
stabilizing a particular receptor conformation. 

Focusing on the G protein-binding residues, 
of the nine that affected efficacy or potency 
when mutated (Materials and methods), one 
residue (driver residue T68°**”) is part of the 
allosteric network. Although T68 does not 
mediate a contact in the receptor-heterotrimeric 
G protein complex structure, it does form a 
contact with Gs in the receptor-Gs peptide 
complex structure (37), which may represent 
an intermediate state in complex formation. 
Furthermore, L275’ (a disconnected driver 
that affects efficacy when mutated) mediates 
contact with a conserved residue on the G pro- 
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tein [H5.25; CGN scheme (28)]. L275°**” is held 
in place through an inactive state-specific 
contact with 1127“. Upon receptor activation, 
this inactive state-specific contact is broken, 
allowing 6x37 to contact the G protein and 
3x46 to mediate an active state-specific con- 
tact with Y326°? (38). Notably, this noncova- 
lent contact is between a residue that affects 
potency only (3x46) and one that affects ef- 
ficacy only (7x53). Thus, the G protein-binding 
residues that affect efficacy and potency when 
mutated can be classified into those that di- 
rectly interact with the G protein and those 
that are part of or are connected to the allo- 
steric network (fig. S10). Together, these obser- 
vations highlight the importance of key rewiring 
events in the allosteric network involving the 
switch from the inactive state-specific con- 
tacts to active state-specific contacts upon 
ligand binding and G protein engagement for 
signal transduction (Fig. 5D and fig. S10). It 
also highlights that as functionally relevant 
intermediate state conformations are structur- 
ally characterized, the roles of some of the resi- 
dues identified can be interpreted in more 
mechanistic detail. 


Modulator residues near the allosteric 
network and functional sites 
modulate pharmacology 


Of the 82 residues that were important for ef- 
ficacy or potency, the role of two-thirds (54 of 
82 residues; 41 are driver and 13 are modu- 
lators) could be explained as either involved in 
ligand or G protein binding, conserved motif 
residues, or driver residues forming the allo- 
steric network or that mediate active state- 
specific contacts (fig. S11). We then investigated 
the role of the remaining one-third of the re- 
sidues that are pharmacologically important 
(28 of 82 residues) and categorized them ac- 
cording to their likely function (fig. S11; these 
are the remaining 28 of the 41 modulator resi- 
dues). Of those, the function of 10 residues 
may be explained by their ability to directly 
contact and perhaps stabilize a residue in the 
ligand-binding pocket or by being in the puta- 
tive ligand entry pathway. Another six resi- 
dues mediate noncovalent contacts with a 
driver residue (network modifier), and the 
function of seven modulators may be explained 
by lipid interaction. The roles for the remain- 
ing five residues were less clear (table S1, ex- 
planation column). We analyzed conformations 
of the active and inactive states, but a deeper 
understanding of the role of modulator resi- 
dues can be obtained through the investiga- 
tion of intermediate conformational states, 
by considering the role of structured water 
molecules in stabilizing or mediating confor- 
mational transitions, or through the analyses 
of data describing receptor dynamics (39, 40). 
Taken together, these results identify and de- 
scribe how residues around the allosteric network 
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and functional sites may modulate ligand effi- 
cacy and potency in B2AR. 


Surface-exposed driver, modulator, and 
passenger residues represent key 
allosteric sites 


Some of the pharmacologically important po- 
sitions mapped to surface-exposed sites that 
were not part of the orthosteric ligand-binding 
site or the G protein-binding interface (Fig. 
2C). The identification of pharmacologically 
important residues that lie on the receptor sur- 
face suggested possible structural sites for 
modulating receptor signaling that could be 
targeted by natural allosteric molecules (41, 42). 
To test whether lipids, such as cholesterol, ex- 
ert effects allosterically by binding to such 
sites and altering signaling, we analyzed struc- 
tures of receptors in complex with cholesterol 
(Materials and methods and fig. S12). Choles- 
terol bound to a site containing three modu- 
lator and three passenger residues on the 
receptor surface. Thus, surface-exposed phar- 
macologically important residues might be 
targeted by synthetic allosteric ligands. In line 


with this idea, an analysis of the structure of 
the receptor in complex with the negative al- 
losteric modulator (NAM) AS408 revealed that 
its binding interface contained two amino 
acids, 1214°*°? (driver) and C125°*** (modula- 
tor), that are important for receptor signaling 
and three passenger residues, all of which are 
surface exposed (Fig. 6, A and B, and Mate- 
rials and methods; 7 residues contact the 
NAM). Similarly, an analysis of the structure of 
the receptor in complex with the positive al- 
losteric modulator (PAM) 6FA showed that it 
bound to a different exposed surface contain- 
ing the driver residue, D130***°, which is im- 
portant for receptor signaling (Fig. 6, A to C). 
Five of the nine residues in the PAM binding 
site were passengers—i.e., residues that medi- 
ate active state-specific contacts but that do 
not affect efficacy or potency when mutated 
(Fig. 6, A to C). Passenger residues were sig- 
nificantly overrepresented (compared with 
random expectation) in both PAM and NAM 
binding sites, even when the N and C termini 
were excluded from the test (P = 15 x 10°; 
hypergeometric test). 
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Fig. 6. Allosteric sites, evolutionary conservation, and natural variation. 
(A) Surface-exposed, pharmacologically important residues are shown in black 


These results show that allosteric modu- 
lators, which were discovered through screen- 
ing efforts, tend to bind surface-exposed sites 
that contain driver, modulator, and passenger 
residues rather than the more prevalent by- 
stander residues (i.e., residues that do not me- 
diate active state-specific contacts and are not 
important for pharmacology). Although the 
active state-specific contacts mediated by the 
passenger residues might minimize entropic 
cost by stabilizing a specific receptor confor- 
mation, thereby energetically favoring alloste- 
ric ligand binding, the perturbation of the 
driver or modulator residues at such sites 
might affect the allosteric network and hence 
mediate the positive or negative cooperativity 
at the orthosteric ligand-binding site. In other 
words, a combination of (i) conformational 
stabilization of the active or inactive state by 
the allosteric ligand through interactions with 
passenger residues and (ii) perturbation of 
the allosteric network through interactions 
with surface-exposed modulator or driver resi- 
dues might contribute to the property of the 
allosteric ligand. An important implication 
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below the x axis refer to different parts of the modulator and are mapped on 
the structure view for reference. (D) The number of residues with known SNPs 
for each class. (E) Percentage of residues for which at least one SNP has been 
reported by class. Significant differences from the expected frequency by chance 
are indicated according to a hypergeometric test (*P < 0.05; **P < 0.01; 
***P < 0.001 ****P < 0.0001; ns, not significant). (F) Adrenergic receptor ET score 
for the different residue classes. Groups were compared using a Wilcoxon 

test (*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001: ns, not significant). 
Driver refers to the subset that is in the disconnected driver group. 
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of this observation is that the discovery of 
such surface-exposed driver, modulator, and 
passenger residues through the integration 
of pharmacological and structural data (in the 
presence of orthosteric ligands and the ab- 
sence of allosteric compounds) may reveal sites 
that could be targeted for development of new 
allosteric ligands. 


Passenger, modulator, and driver residues are 
under higher evolutionary selection pressure 


To assess the importance of the residues rele- 
vant for pharmacology and/or active-state 
conformation, we analyzed data on natural 
genetic variation from the human population. 
An analysis of the genome sequence data from 
more than 140,000 individuals available through 
the gnomAD database (43) revealed a notable 
pattern in the distribution of single-nucleotide 
polymorphisms (SNPs) according to the differ- 
ent residue categories (Fig. 6, D and E). By- 
stander residues harbored a larger number 
of positions with SNPs compared with what 
is expected by chance. Passenger and mod- 
ulator residues contained fewer positions with 
SNPs than expected by chance. The allosteric 
network mediated by the connected driver re- 
sidues had the lowest number of positions 
harboring SNPs in the human population, 
which indicates that these residues were under 
the strongest selection pressure (Fig. 6, D and 
E, and Materials and methods). Thus, driver, 
modulator, and passenger residues appear to 
be under differential selection pressure com- 
pared with bystander residues of this receptor 
in the human population, which is consistent 
with the importance of the residue categories. 

To assess the importance of the residues 
across orthologs of adrenergic receptors from 
different species, we performed an analysis of 
evolutionary trace (ET) scores in the context of 
our residue classification (i.e., driver, modula- 
tor, passenger, and bystander). ET is a phylo- 
genetic method that identifies functionally 
important positions in protein families. Low 
ET scores indicate positions that are highly 
conserved across homologs. Depending on how 
the alignments are constructed, residues with 
low ET scores can also represent positions that 
are conserved in a subfamily-specific manner 
(44-46). Driver residues were the most con- 
served followed by modulator, passenger, and 
bystander residues (Fig. 6F). Not all highly 
conserved residues were pharmacologically 
or structurally important; hence, the role of a 
residue cannot be predicted from evolutionary 
conservation alone. These findings reveal that 
categorizing residues on the basis of structural 
and pharmacological importance can reveal 
category-specific associations of positions under 
purifying selection within the adrenergic re- 
ceptors, aminergic receptors, as well as the 
entire family of class A GPCRs (Fig. 6F and 
fig. S13). 
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Mapping of the ET scores (adrenergic GPCRs) 
onto the allosteric networks revealed that driver 
residues with the lowest ET scores were at the 
center of the network in the receptor core, 
including all of the network residues in TM3 
and TM7 (fig. S13). By contrast, the network 
positions with the lowest conservation (i.e., 
higher variability; ET scores > 3rd quartile, 
higher than 75% of the values) were either 
close to the ligand-binding site or in the in- 
tracellular region (fig. S13). These positions are 
more tolerant to mutations and may evolve or 
have evolved to recognize new ligands or dif- 
ferent G proteins (fig. S13). This suggests that 
although a major part of the residues consti- 
tuting the allosteric network is conserved 
within adrenergic GPCRs, key positions that 
interface with the ligand or G protein that are 
necessarily receptor-specific tend to be more 
variable. The uncoupling of ligand- and G 
protein-binding sites might have facilitated 
the evolution of GPCRs to respond to chem- 
ically diverse ligands and to couple to different 
G proteins. Collectively, the observations on 
human genetics and evolutionary data, which 
are independent of the approaches that we 
used to identify the residue classes, represent 
further evidence to support the model of the 
residues that govern the molecular origins of 
efficacy and potency in this signaling system. 


Discussion 


Ligand efficacy and potency at a receptor are 
the key pharmacological parameters used to 
describe ligand-mediated receptor signaling. 
Although it is widely established that agonist 
binding activates the receptor and mediates 
structural rearrangements, ultimately result- 
ing in a signaling response, we do not fully 
understand the molecular origins of this pro- 
cess and how each amino acid in the receptor 
decodes the information encoded in ligand 
atoms and collectively translates this into the 
emergence of efficacy and potency. We de- 
signed an integrative approach, combining 
pharmacological, structural, computational, 
and evolutionary analyses at a per-residue 
level, that revealed how receptors decode 
the information on the ligand atoms, mediate 
a series of structural changes through an al- 
losteric network, and ultimately influence G 
protein binding and the downstream signal- 
ing response. 

One-fifth of all residues in B2AR were im- 
portant for adrenaline efficacy and potency for 
Gs signaling. Although some residues mapped 
to functionally important sites, such as ligand- 
binding, G protein-binding, and conserved 
motif residues, more than two-thirds of the 
pharmacologically important residues mapped 
to other sites on the receptor, including those 
on the receptor surface. A conceptual frame- 
work that combined structural and pharma- 
cological data helped us to propose potential 
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roles for the receptor residues and allowed us 
to distinguish key, pharmacologically impor- 
tant structural changes from those that are 
less pharmacologically relevant. This frame- 
work resulted in the identification of an al- 
losteric network of noncovalent contacts (based 
on active state-specific structural changes) 
that are mediated by pharmacologically im- 
portant residues from the extracellular to the 
intracellular side. Our classification of residues 
into drivers, modulators, passengers, and by- 
standers provides a rational framework to 
explain the molecular and structural origins of 
potency and efficacy. 

The exact nature of the pharmacologically 
and structurally important residues, as well as 
the allosteric network, likely depends on the 
choice of ligand, receptor, and effector. We 
expect considerable overlap for closely rela- 
ted systems based on the evolutionary analy- 
ses, but it is likely that the residues and the 
network will consist of nonoverlapping receptor 
positions that depend on the specific ligand 
and the effector, including the G protein subtype. 

In addition to revealing such networks, the 
detailed analysis of ligand atom contacts with 
receptor residues presented in this work can 
help us to better understand the role of in- 
dividual atoms in eliciting a pharmacological 
response. It can also aid in ligand design by 
enabling the identification of chemical groups 
that can be modified to achieve a desired sig- 
naling response. Our observation that surface- 
exposed driver, modulator, and passenger 
residues identified using the endogenous lig- 
and are contacted by NAMs and PAMs sug- 
gests that such sites should be prioritized in 
high-throughput virtual screening efforts for 
the discovery of potential allosteric modulators. 

Apart from revealing general principles, the 
approach and the findings presented here can 
help us understand how disrupting and engi- 
neering parts of the network could alter re- 
ceptor output, thereby facilitating the design 
of receptors with desired signaling properties 
(47). It could also help us understand how 
changes in receptor residues, such as SNPs or 
disease-related mutations, could influence sig- 
naling responses to endogenous agonists or 
GPCR drugs. The framework developed in this 
work should enable the investigation of such 
questions for diverse ligand-receptor-signaling 
systems by leveraging high-throughput ap- 
proaches, such as deep mutational scanning, 
to pharmacologically characterize mutations 
and the increasing availability of new struc- 
tures of ligand-receptor-signaling protein com- 
plexes. Furthermore, the framework to classify 
residues on the basis of pharmacological and 
structural relevance can be applied to any func- 
tional readout and property describing confor- 
mational change. In this manner, the approach 
can be extended to biological systems beyond 
GPCRs. 


8 of 12 


RESEARCH | RESEARCH ARTICLE 


Materials and methods 

Experimental methods 

DNA constructs and mutagenesis 

The design of the human B2AR construct was 
based on the commercially available pSNAPf 
-ADRB2 Control Plasmid (NEB) coding se- 
quence, which consist of a N-terminal signal 
sequence, the SNAP-tag, and the receptor se- 
quence. A BamHI restriction site was added 
between the SNAP tag and the receptor se- 
quence, the receptor sequence was codon opti- 
mized. The wild-type receptor amino acid at 
position 16 was arginine. The two most com- 
mon alleles at position 16 are arginine and 
glycine (as defined by the 1000 Genomes Proj- 
ect). The SNAP-82AR was cloned into pecDNA3.1. 
BRET-based biosensor constructs for Gas, 
Gf1, and Gy1 subunits were in pcDNA3.1. The 
Gs biosensor consisted of RlucII-Gas, where 
Renilla luciferase (RlucII) was inserted after 
amino acid 67, unmodified human G1, and 
GFP10-Gy1. 

B2AR mutants for positions 2 to 412 were 
generated as described (48). Amino acids other 
than alanine were replaced by alanine, and 
native alanines were replaced with glycines. 
Briefly, primers were designed using the custom- 
made software AAScan (49) (available at https:// 
github.com/dmitryveprintsev/AAScan) and _ or- 
dered from Integrated DNA Technologies (IDT) 
as 500 pmol DNA oligos in 96-well plates. The 
primer sequences are available in table S83. 
Forward and reverse mutagenesis primers were 
used in separate polymerase chain reaction 
(PCR) reactions together with one primer 
each, which annealed in the ampicillin resis- 
tance gene (CTCTTACTGTCATGCCATCCGTAA- 
GATGC and GCATCTTACGGATGGCATGACAG- 
TAAGAG). PCRs were run as touchdown PCRs 
in 20 ul final volume. Resulting half-vector 
fragments were combined, digested with 0.5 ul 
DpnI (NEB) for 1 hour, cleaned up using ZR-96 
DNA Clean & Concentrator-5 (Zymo Research), 
assembled by Gibson assembly using HiFi 
DNA assembly Master Mix (NEB) at 45°C for 
1 hour (50) and transformed using Mix & Go 
competent cells (Zymo Research). The result- 
ing colonies were cultured in 5 ml LB medium 
in 24-well plates overnight. The DNA was 
purified using Qiaprep 96 Plus kits (Qiagen) 
and sent for sequencing (Bio Basic Inc. or in- 
house). Sequences were analyzed using cus- 
tom software MutantChecker (available at 
https://github.com/dmitryveprintsev/AAScan). 
Mutants that were not successfully generated 
as described were cloned using an alternative 
method: pcDNA3.1 was digested with Nhel 
and XhoI (NEB), the B2AR coding sequence 
was amplified in two PCRs, each using one of 
the mutagenesis primers and either T7long or 
BGH primer (CGAAATTAATACGACTCACTA- 
TAGGGAGACCCAAGCTGG and TAGAAGG- 
CACAGTCGAGG), which anneal upstream and 
downstream of the coding sequence, respec- 
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tively. The two fragments were combined, di- 
gested using DpnI, and cleaned up as described 
above. The clean fragments were assembled 
with digested, cleaned pcDNA3.1 using HiFi 
DNA assembly Master Mix (NEB). 


Cell-culture and BRET-based signaling assays 


Adherent HEK-293 SL cells (a gift from 
S. Laporte) were cultured in Dulbecco’s mini- 
mum essential medium (DMEM) with 4.5 g/1 
glucose, L-glutamine, supplemented with 10% 
newborn calf serum (NCS) (Wisent BioProducts, 
Canada) and 1x penicillin-streptomycin (PS) 
(100X, Wisent BioProducts, Canada) at 37°C 
with 5% COs. For transfections, DMEM with- 
out phenol red was used. Cells were transiently 
transfected using linear 25 kDa polyethylenei- 
mine (PET) (Polysciences Inc., Canada, no. 23966) 
in a 3:1 ratio with DNA. Per condition, a total of 
1 ug DNA was used to transfect 240,000 cells 
in 1.2 ml in suspension. The DNA mix con- 
sisted of 100 ng receptor DNA, combined with 
75 ng Gas-RluclIl, 200 ng GFP10-Gy1, and 100 ng 
GB1 plasmid DNAs; finally, 525 ng salmon 
sperm DNA (ssDNA) was added to bring the 
total amount of DNA to 1000 ng total in a vol- 
ume of 100 ul. DNA was then combined with 
diluted PEI in phosphate-buffered saline 
(PBS) and added to the cells. Cells were 
seeded at 20,000 cells per well into Cellstar 
PS 96-well cell culture plates (Greiner Bio-One, 
Germany) and incubated for 2 days at 37°C, 5% 
CO,. A Fluent780 liquid handler equipped 
with an MCA 96-well head (Tecan) and a 
Multidrop Combi (Thermo), each installed in 
a class II biosafety cabinet were used to aid 
with transfections. 

Before measurement, the medium was re- 
moved from the 96-well plates followed by the 
addition of Tyrode’s buffer (137 mM NaCl, 
0.9 mM KCl, 1 mM MgCl, 11.9 mM NaHCOs, 
3.6 mM NaH»PO,, 25 mM Hepes, 5.5 mM glu- 
cose, 1 mM CaCly, pH 7.4) and incubated for 
at least 30 min at 37°C. Plates were treated 
with ligand or vehicle (buffer control) for 
10 min before measurement and with co- 
elenterazine 400a (Nanolight Technology) 
at 5 uM final 5 min before measurement. 
Ligand concentrations used were: 31.6 nM 
(1078 M) to 3.16 mM (107*°M) in half-log 
steps, diluted in vehicle, plus vehicle itself. 
Coelenterazine 400 a was prepared with 1% 
Pluronic F-127 in Tyrode’s buffer to increase 
solubility. Adrenaline was prepared in 0.01 M 
HCI to increase stability, the low pH was buf- 
fered by Tyrode’s buffer upon addition of lig- 
and to the cells. The stability of adrenaline in 
the above aqueous solution was confirmed by 
comparing the signaling responses obtained 
immediately after ligand preparation, after 
3 hours, 6 hours 20 min, 8 hours 40 min, and 
11 hours. The BRET signal was read using a 
Synergy Neo (Biotek) equipped with dual pho- 
tomultiplier tubes (PMTs) (emission: 410 nm 
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and 515 nm, gain 150 for each PMT and 1.2 s 
integration time). BRET was measured on an 
integrated platform with a Biomek NX liq- 
uid handler equipped with a 96-well head 
(Beckman, used for ligand addition), a multi- 
drop combi (Thermo, luciferase substrate ad- 
dition), an automated CO, incubator Cytomat 
6001 (Thermo, plate storage before experi- 
ment and during incubation), a barcode reader 
and Synergy Neo plate reader (Biotek). Ligand 
dilutions were kept at 4°C with the aid of a 
Peltier element. All signaling experiments were 
run in three biological replicates by default. In 
those cases where very low luciferase counts 
for multiple plates were observed, indicative of 
a failing transfection, all transfections and 
measurements of the day were repeated. All 
transfections and measurements for the days 
where an instrument failure led to a premature 
stop of the robotic system were repeated. BRET 
data for transfections with very low luciferase 
counts were completely removed during the 
data analysis (see manual curation). 

All signaling experiments were carried out 
in at least biological triplicates, wild-type and 
mock-transfection controls were repeated six 
times per measurement day. 


Cell-surface ELISA 


HEK-293 SL cells were transfected using PEI 
as above and seeded into poly-L-lysine coated 
96-well plates (Greiner BioOne, Germany) and 
incubated for 2 days at 37°C 5% CO». Each well 
was washed with 200 ul PBS, the cells were 
then treated with 50 ul 3% paraformaldehyde 
per well for 10 min. Each well was washed as 
follows: Three washes with wash buffer total 
(PBS + 0.5% BSA); for the last wash step, the 
cells were incubated with wash buffer for 
10 min. Primary rabbit anti-SNAP antibody 
(GenScript, USA) was added at 0.25 ug/ml in 
50 ul and incubated for 1 hour at room tem- 
perature (RT), followed by three wash steps 
as described above. The cells were incubated 
with secondary anti-rabbit HRP antibody (GE 
Healthcare) diluted 1:1000 for 1 hour. Again, 
the cells were washed three times with wash 
buffer and then three times with PBS. Per well, 
100 ul SigmaFast solution was added, the plates 
were incubated at RT in the dark. Reactions 
were stopped by the addition of 25 ul 3M HCl. 
100 ul of the resulting solution was transferred 
to a new transparent 96-well plate, and the 
optical density was read at 492 nm using a 
Tecan GENios Plus microplate reader. Cell- 
surface ELISAs were carried out in three bio- 
logical replicates with internal quadruplicates. 
Each plate contained wild-type and mock- 
transfected cells as controls. 


Analysis of biosensor signal dependence 
on cell-surface expression 


Twelve different amounts of wild-type recep- 
tor DNA were combined with constant amounts 
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of biosensor DNA and transfected into HEK293 
cells as described above. The amounts of wild- 
type receptor DNA were 0.05, 0.1, 0.2, 0.5, 1, 2, 5, 
10, 20, 50, 150 and 200 ng. Two days after 
transfection, the cell-surface expression level 
and Gs biosensor BRET signal were deter- 
mined for each transfection. Cell-surface ex- 
pression in percent of the level obtained for a 
100 ng wild-type receptor DNA transfection 
(the standard used for signaling experiments 
as described above) was then plotted against 
Gs biosensor signal amplitude and Gs biosen- 
sor logECso (fig. S3, A and B). All experiments 
were repeated in three to four biological 
replicates. 


Computational methods and data analysis 
Processing of signaling data 


The BRET data were processed using BRET2DTF 
and DataFitter custom software (Dmitry Veprintsev; 
https://github.com/dbv123w/DataFitter). All 
biological replicates of concentration-response 
curves were separately fitted to a Hill equation 
using a Hill slope of 1 to determine the agonist 
concentration at the half-maximal signal re- 
sponse (logEC;,) as well as the pretransition 
and posttransition baselines (may also be re- 
ferred to as top and bottom). All concentration- 
response curves were visually inspected. We 
removed complete repeats (i.e., all 12 data points) 
in those cases where the experiment had ob- 
viously failed due to a transfection that did 
not work (see explanation in “Cell-culture and 
BRET-based signaling assays” section above) 
or where data could not be fitted reliably due 
to noise. In the whole mutant dataset of 1324 
concentration-response curves (number ex- 
cludes controls), 42 single replicate curves 
were excluded. In addition, replicates for seven 
mutations showed no measurable signaling 
(see table S1, curation column set to 2). Data 
could be fitted starting at ~20% of wild-type 
amplitude, corresponding to ~0.015 difference 
between the pre- and posttransition baselines 
in raw BRET (where raw BRET = GFP/RlucII 
signal). In total, after these curation steps, 
quadruplicates for 77 mutants, triplicates for 
306 mutants, and duplicates for 29 mutants 
were retained. No single data points were re- 
moved in any case. 

The results from data fitting were read into 
R (RStudio 1.3.959 to 2022.07.1 and R 4.0.0 to 
4.2.1) and processed using custom scripts. The 
following packages were used: tidyverse (espe- 
cially dplyr, ggplot2, purrr, tibble, tidyr, forcats, 
and stringr), plotly, MASS, reshape, reshape2, 
ggrepel, patchwork, ggpubr, bio3d (57), and 
openxlsx. 

The data were read in, reformatted, aver- 
aged, annotated (addition of GPCRdb number, 
expression level, amino acid number, muta- 
tion), filtered based on the results of manual 
curation, normalized, and visualized in R. We 
checked the dataset for outliers, day-to-day 
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variation and trends, the effect of expression 
level on signaling response, and baseline sig- 
nal variation. We compared the BRET signal 
for mock transfections where ssDNA was used 
instead of receptor DNA with mutant pre- 
transition baselines to assess possible changes 
in constitutive activity. Based on the variabil- 
ity of wild-type pretransition baselines we 
decided not to interpret changes in mutant pre- 
transition baselines. Although we were not 
able to measure changes in constitutive ac- 
tivity reliably, this does not mean that none 
of the mutants affect constitutive activity. 
Measured mutant amplitudes were normal- 
ized to the wild type using the most recently 
measured wild-type dataset to correct for day- 
to-day variation and variation during the day, 
whereas mutant logEC;9 values were normal- 
ized to the mean logEC;, obtained for the wild 
type. The normalization did not correct for 
expression level because the dependence of 
the signal was minor for receptor expressing 
to at least 25% of wild-type level (fig. S3, A 
and B). The measured logEC;, and amplitude 
values were also used to interpret 22 receptor 
positions in Hauser e¢ al. (15) and Hedderich 
et al. (42). The typical error of an ELISA was 
much larger than the error of a BRET exper- 
iment, at least in our experience; such a cor- 
rection would therefore have rather lowered 
the confidence in the results. Cutoffs were ap- 
plied to the normalized, and curated data, 
resulting in discretization. Once normalized, 
the wild-type data were centered around a 
normalized amplitude of 1 and a normalized 
logECs, of 0. Cutoffs were chosen so that none 
of the 57 wild-type concentration-response 
curves (measured on different days and batches) 
were classified as not wild-type-like. For the 
normalized amplitude, the cutoff was <0.74 (this 
corresponds to two wild-type standard devia- 
tions). For the normalized logECs, the cut- 
off was >0.87, this corresponds to a 7.4-fold 
change. 

Our cutoff for a gain of function would be 
>1.28 for the normalized amplitude and <-0.71 
for the normalized logEC;, (cutoffs were ap- 
plied to nonrounded values; table S1 values 
are displayed to two decimal precision). Be- 
cause we were interested in identifying those 
positions whose mutation impairs signaling, 
the one mutation that increased efficacy was 
not explicitly considered in our analyses. 

Graphs and illustrations were generated in 
R (ggplot2), Adobe Illustrator CS6 and CC and 
Affinity Designer 2. For display in figures, 
concentration-response curves were fitted in 
R using the package dre. The raw BRET values 
were fitted to obtain the BRET value for the 
pretransition baseline. Subsequently, the data 
were normalized using the pretransition base- 
line and the wild-type signal amplitude, re- 
sulting in a normalized BRET response where 
0% and 100% correspond to the wild-type re- 
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sponse. For fitting of the normalized data, the 
LL4 function was used, the slope was fixed to 
1, and the pretransition baseline was fixed to 0. 


Residue-residue contacts 


Residue-residue contacts were obtained from 
the Protein Contacts Atlas (http://pca.mbgroup. 
bio/index.html) (52). Two residues are listed as 
a contact if the distance between the two 
atoms minus their van der Waals radii equals 
0.5 A or less, corresponding to a maximum 
distance of ~4.2 A. Text files downloaded from 
Protein Contacts Atlas containing residue- 
residue contacts were read into R and cleaned. 
We then fortified the tables with additional 
information [secondary structure elements 
(SSEs) and GPCRdb numbers for each amino 
acid], filtered the contact tables to exclude 
main chain-main chain contacts, and include 
contacts between two different SSEs only. The 
contacts were filtered to include only those 
residues resolved in both crystal structures 
used for the comparison, 2RH1 and 3SN6. In 
the inactive and in the active G protein-bound 
structures of the B2AR, 282 and 285 residues 
form 1275 and 1233 residue-residue contacts, 
respectively. To focus on residue contacts that 
contribute to large domain motions instead of 
local rearrangements during activation, we 
excluded contacts formed exclusively between 
backbone atoms and contacts within the same 
secondary structure element (i.e., contacts 
formed within a single helix). This reduced 
the number of residue-residue contacts to 117 
contacts that are only present in the inactive 
state; 61 contacts that are only present in the 
active, G protein-bound state; and 191 con- 
tacts that are present in both the active and 
inactive states. Contacts are listed in table S2. 


Definition of the ligand- and 

G protein-binding sites 

Definitions of the ligand-binding site and the 
G protein-binding site were based on the struc- 
tures of B2AR with adrenaline and an engi- 
neered nanobody (PDB ID: 4LDO) (25), in 
complex with heterotrimeric G protein and 
Gs peptide (3SN6 and 6E67) (/4, 37). All re- 
sidues within 4 A of adrenaline were classi- 
fied as part of the ligand-binding site, the 
residues were verified using LigPlot+ v.2.2 
and literature data (26, 53-56). Using PYMOL 
2.5.2, all residues within 4 A of G protein or Gs 
peptide were determined and classified as 
residues in the G protein-binding site. Ad- 
ditionally, this residue selection was confirmed 
by atom-atom contacts from the Protein Con- 
tact Atlas. Residues in the ligand and G protein- 
binding site, respectively, are marked in 
table S1. 


Residue classification 


We defined driver and modulator residues as 
those residues that affect Gs signaling upon 
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mutation (see above for cutoffs). Additionally, 
disconnected drivers participate in an active 
state-specific contact, and connected driver 
residues form an active state-specific contact 
to another driver residue. In addition, both 
connected driver residues need to negatively 
affect potency, or both need to affect efficacy. 
That is, if the mutation at one driver residue 
affected potency and the mutation at the in- 
teracting driver residue affected efficacy they 
were classified as disconnected drivers due 
to a mismatch in effects. Passenger and by- 
stander residues were defined as those resi- 
dues that do not affect Gs signaling upon 
mutation. Additionally, passenger residues 
must form at least one active state-specific 
contact. Residues not forming such contacts 
and not affecting Gs signaling upon mutation 
were classified as bystanders. Hierarchically, 
we defined (i) connected drivers as the struc- 
turally and pharmacologically most impor- 
tant residues, followed by (ii) disconnected 
drivers and (iii) modulators that are pharma- 
cologically important but do not mediate active 
state-specific contact, (iv) passenger residues 
which mediate active state-specific contact(s) 
but do not affect pharmacology, and finally 
(v) bystanders, which are not involved in form- 
ing an active state-specific contact and do 
not affect pharmacology. The allosteric net- 
work consisted of 15 residue-residue contacts 
(33 atom-atom contacts) between 23 residues. 
Of the 33 atom-atom contacts, 26 were side 
chain-side chain contacts and 7 were main 
chain-side chain contacts. 


ET analysis 


We used ET analysis to identify B2AR residues 
with evolutionarily encoded functional roles 
at the cross section of the network of pharma- 
cologically important residue positions. ET 
scores were obtained from the Lichtarge Lab 
ET webserver (http://evolution.lichtargelab. 
org/gpcr). We compared the conservation of 
amino acids at the level of the adrenergic re- 
ceptor subfamily [“Adrenoceptor (118)”], amine 
receptors [“All_amine (547)”], and class A 
(“ClassA_5105”) GPCRs. For each phylogenetic 
comparison, the lower ET scores at a residue 
position represent high conservation of spe- 
cific amino acids at the position within the 
phylogenetic class. We compared the enrich- 
ment of low ET score positions among the 
residues found in the driver, modulator, pas- 
senger, and bystander categories. Groups were 
compared using a Wilcoxon test (*P < 0.05; 
**P < 0.01; ***P < 0.001; ****P < 0.0001; ns, 
not significant). 


Extraction of structure features 


To calculate the distance between Co atoms of 
each residue in the inactive versus the active 
state, we used the highest-resolution inactive- 
state structure (PDB ID: 2RH1) and the only 
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available G protein complex structure at the 
time of performing this analysis (PDB ID: 
3SN6) of the B2AR, aligned the two structures 
in PyMOL and exported the new coordinates. 
The PDB files were read into R using the pack- 
age bio3d, followed by extraction of the x, y, 
and z coordinates for each structure and the 
calculation of distance as 


distance = 


V (eave = inactive) + (Yactive — Yinaetive)” + (Zactive — inactive)” 


Likewise, we extracted the dihedral angles 
for each residue in the above structures using 
bio3d. The difference in the angles 0, y, and 
@ between the active and inactive state was 
calculated as 


Ao = (active inactive + 180) %360 — 180 


Calculation of accessible surface area 


Accessible surface area was calculated using 
dssp (https://swift.cmbi.umen.nl/gv/dssp/) (57, 58), 
and values are provided in table S1. In addi- 
tion, we calculated the accessible surface area 
of the adrenaline-binding site with and with- 
out the ligand present using MDTraj (https:// 
www.mdtraj.org) and GROMACS (https://www. 
gromacs.org/). 


Analysis of SNP data 


SNP data for ADRB2 was retrieved from the 
Genome Aggregation Database (gnomAD, 
https://gnomad.broadinstitute.org/, version 
gnomAD v2.1.1). The data were analyzed using 
custom scripts with functions from the tidyverse 
(especially readr, dplyr, and ggplot2), and the 
packages janitor, rebus, and stats. Statistical 
over- and underrepresentation of SNPs in 
driver, modulator, passenger, and bystander 
positions were calculated using a hypergeo- 
metric test. 
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TEXTILES 


Biomimetic, knittable aerogel fiber for thermal 


insulation textile 


Mingrui Wu’, Ziyu Shao’, Nifang Zhao’, Rongzhen Zhang’, Guodong Yuan’, Lulu Tian’, Zibei Zhang’, 


Weiwei Gao**, Hao Bai*2* 


Aerogels have been considered as an ideal material for thermal insulation. Unfortunately, their 
application in textiles is greatly limited by their fragility and poor processability. We overcame these 
issues by encapsulating the aerogel fiber with a stretchable layer, mimicking the core-shell structure of 
polar bear hair. Despite its high internal porosity over 90%, our fiber is stretchable up to 1000% 
strain, which is greatly improved compared with that of traditional aerogel fibers (~2% strain). In 
addition to its washability and dyeability, our fiber is mechanically robust, retaining its stable thermal 
insulation property after 10,000 stretching cycles (100% strain). A sweater knitted with our fiber 

was only one-fifth as thick as down, with similar performance. Our strategy for this fiber provides rich 
possibilities for developing multifunctional aerogel fibers and textiles. 


erogels are recognized as the best ther- 

mal insulation material for their high 

porosity and extremely low thermal con- 

ductivity (J-4). Since their invention in 

the 1930s, aerogels have been applied in 
a wide spectrum of engineering fields, such as 
green buildings (5), energy storage devices (6), 
catalyst carriers (7), environmental treatment 
(8), and others. Unfortunately, the application 
of aerogels in thermal insulation textiles have 
been greatly limited because of their fragility 
and poor processability, which usually inevi- 
tably yields irreversible structural collapse and, 
therefore, loss of thermal insulation perfor- 
mance during the practical application of the 
textile. Although there have been a few attempts 
to mix aerogels into woven (9) or nonwoven 
(10) fabrics, the application of aerogels in ther- 
mal insulation textiles is still severely limited 
owing to either poor processability at high 
doping content or inefficient thermal insula- 
tion at low content (J/, 12). Aside from doping 
or coating aerogels to textiles, efforts were also 
made to directly develop aerogel fibers with 
excellent thermal insulation property (73). For 
example, over the last decade, many aerogel 
fibers made with silica (14), graphene (J5, 16), 
carbon nanotube (7), titanium oxide (78), and 
cellulose (79, 20) have been prepared through 
various methods, including wet spinning (27), 
reaction spinning (22), and sol-gel confined 
transition (23). Although some of these recently 
developed aerogel fibers have achieved better 
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mechanical properties (24, 25), they still have 
insufficient strength and flexibility for weav- 
ing or knitting into textiles, especially at a large 
area (26, 27). Moreover, these aerogel fibers are 
not machine washable and easily lose their 
thermal insulation capability underwater or in 
a humid environment. Therefore, designing 
and fabricating knittable and weavable aero- 
gel fibers with excellent thermal insulation 
performance and multifunctionality remains a 
challenge. 

Many animals have evolved special furs to 
survive in the extremely cold environments 
(28-30). As a prime example, polar bear hair 
exhibits a distinct core-shell structure (37-33). 
The hollow core with high porosity effectively 
traps air and provides excellent thermal insu- 
lation performance (37), and the dense shell 
provides efficient mechanical protection (about 
300 MPa and 35% in tensile strength and 
strain, respectively) (34, 35). As a result, peo- 
ple have long been fascinated by the capabil- 
ity of polar bear hair to insulate heat in the 
Arctic. Mimicking the strategies of the polar 
bear hair represents a feasible way to develop 
robust thermal insulation materials, especially 
fibers and textiles (36-39). 

We encapsulated aerogel fibers with a thin, 
stretchable layer without severely sacrificing 
the thermal insulation property. The encapsu- 
lated aerogel fiber (EAF) is knittable and simul- 
taneously achieves both thermal insulation and 
mechanical robustness by mimicking the core- 
shell structure of the polar bear hair. We con- 
tinuously fabricated our EAFs using a facile 
two-step method in which aligned porous aero- 
gel fibers are obtained through freeze-spinning 
(36, 40-42), followed by a coating-drying step. 
We chose thermoplastic polyurethane (TPU) 
for the encapsulation layer because of its high 
mechanical strength and stretchability (43). 


Our EAF has a high internal porosity of over 
90% (measured by Mercury Porosimetry) while 
exhibiting a large tensile strength of around 
7.3 to 12.7 MPa (capable of lifting a 500-g weight 
with a single fiber). In addition, the EAF shows 
super stretchability of around 1000 to 1600% 
strain, nearly three orders of magnitude im- 
proved compared with traditional aerogel fiber 
(usually around 2% strain). The mechanical 
properties make our fibers suitable for knitting 
and weaving. A sweater knitted with our fibers 
is only around one-fifth as thick as down for the 
same warmth and has excellent washability 
and dyeability. With its simplicity and scal- 
ability, our work provides a feasible strategy 
to fabricate robust, super stretchable, and 
multifunctional aerogel fibers, which are fun- 
damental for the development of thermal 
insulation textiles toward applications in ex- 
treme environments. 


Biomimetic principle and fabrication of EAF 


Polar bear hairs have a porous core and dense 
shell structure to prevent heat loss (Fig. 1A) 
(32, 44). These structural features decouple 
the thermal insulation function and me- 
chanical strength of the hair, which provided 
the inspiration for our design and prepara- 
tion of the synthetic fibers (45). With plenty 
of pores tens of micrometers in size, the polar 
bear hair captures a large volume of stationary 
air, which effectively suppresses both thermal 
conduction and convection. The aligned porous 
walls substantially reduce the number of ther- 
mal conductive paths and provide a multireflec- 
tive effect for strong infrared (IR) reflectance, 
which are both beneficial for efficient thermal 
insulation (7, 36). The dense shell protects the 
porous cores, making the hair waterproof and 
insensitive to humid air, which are the main rea- 
sons for the loss of thermal insulation prop- 
erty in typical aerogel materials. 

We designed an encapsulated aerogel fiber 
with similar thermal insulation mechanisms 
(Fig. 1B) and developed a facile two-step fabri- 
cation route to mimic the core-shell structure 
(Fig. 1C). First, we obtained aerogel fibers by 
freeze-spinning (36). We performed the freeze- 
spinning by using a well-dispersed polymer 
solution, which we extruded through a cold 
region, to generate a continuous stable ice fiber, 
which we collected using a motor. By con- 
trolling the extruding speed and the temper- 
ature of the cold source, we could adjust the 
porous structure within the aerogel fiber. Sub- 
sequently, the collected frozen fiber was freeze 
dried to retain the lamellar porous structure 
within the fiber (46-48). Finally, the aerogel 
fiber was coated with TPU solution and dried 
by coating-drying equipment, yielding the en- 
capsulated aerogel fiber with the biomimetic 
core-shell structure. The as-prepared EAF ex- 
hibits similar core-shell microstructure com- 
pared to the polar bear hair, as we show with 
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Fig. 1. Design and fabrication 

of encapsulated aerogel fiber. 
(A and B) Schematic illustration of 
thermal insulation principles of the 
(A) polar bear hairs and (B) our 
EAFs. (C) EAFs are prepared by 
freeze-spinning followed by a 
coating-drying step. (D and E) SEM 
images of a (D) polar bear hair 
and (E) EAF. (F) Optical images 
of as-prepared fiber reels showing 
the potential of large-scale 
fabrication. 


A 


¢) Convection 
t Conduction 
} Radiation 


rs 
\ 
/ . . 
, Heatinsulation \ 
hairs 


os es 


Core-shell structure 


Cc Freeze-spinning 


Spinning 
orifice 


Ice growth 


Cold source 


scanning electron microscopy (SEM) images 
(Fig. 1, D and E). Through this facile approach, we 
realized various EAFs with a wide range of shell 
thickness and pore size at a large scale (Fig. 1F). 


Thermal insulation and mechanical properties 
of EAF 


The encapsulating layer has opposite effects 
on the fiber properties. The thicker layer pro- 
vides stronger fibers yet reduces their thermal 
insulation efficiency (49). Therefore, we opti- 
mized the thickness of the encapsulating layer 
to ensure that the EAF achieved both excellent 
thermal insulation and mechanical strength. 
We show a series of SEM images exhibiting 
fibers coated with TPU at various thicknesses: 
0, ~30, ~80, and ~140 um (Fig. 2, A to D, re- 
spectively). We controlled the shell thickness 
by altering the viscosity of the coating solution 
and varying the coating times (fig. SIA). We 
compared these fibers with various encapsu- 
lating layer thicknesses for their thermal in- 
sulation performance by placing them on the 
same hot stage. We collected a series of IR im- 
ages at varying stage temperatures from -20 
to 80°C, showing two typical images (fig. SIB) 
when the stage temperature was -20 and 80°C, 
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respectively. We measured the temperatures 
of the fiber surface and the stage on the basis 
of the IR images, which were taken when the 
surface temperature was stable, and the am- 
bient temperature was kept at around 20°C 
during the measurement. We summarize the 
corresponding temperature difference (|A7]) 
between the stage and the fiber surface in 
fig. SIC. As we expected, the thermal insu- 
lation performance of the aerogel fiber de- 
creased with a thicker encapsulating layer. 
The |A7| of a single fiber (fig. SIC) is lower 
than that of a textile because the thinner fiber 
is more easily affected by its surroundings. The 
aligned pores provide a multireflective effect 
for strong IR reflectance, which greatly re- 
duces thermal radiation (fig. S2). In terms of 
mechanical properties, the aerogel fiber be- 
came highly robust and stretchable after the 
inclusion of the dense encapsulating layer. 
The tensile strength of the fibers increased 
from 2.3 to 12.7 MPa, with shell thickness 
varying from 0 to 140 um (Fig. 2E), whereas 
their elongations increased from 24 to 1600%. 
Although the core structure of our EAF might 
fracture during stretching, the encapsulating 
layer remained continuous, which ensured the 


Encapsulated aerogel textile 


Encapsulated aerogel fibers 


Encapsulating 


f ) Heater & 


TPU solution Encapsulated aerogel fiber 


Porous core 


200 um 


integrity of EAF’s structure and the stability of 
its function. The EAF with a diameter of 500 um 
could even lift a 500-g weight (Fig. 2F). We 
summarize the variation of tensile strength 
and thermal insulation properties in respect to 
the encapsulating layer thickness in Fig. 2G. 
We clearly show that the addition of the en- 
capsulating layer can substantially increase 
the mechanical performance of the aerogel 
fiber yet maintain thermal insulation proper- 
ties. Therefore, we set the encapsulating layer 
to around 80 um for the aerogel fiber 600 um 
in diameter of to ensure both excellent ther- 
mal insulation and mechanical performances. 


Multifunction of EAF 


The encapsulating layer not only provides the 
aerogel fiber enough strength, but also pro- 
tection for its thermal insulation property 
under cyclic stretching. The dense and intact 
encapsulated layer allows the EAF to fully 
recover after being stretched to even 1000% 
strain, with no change in its length (Fig. 3, Aand 
B). In daily wearing, the fibers are subjected to 
cyclic stretching and radial compression, which 
are difficult requirements for conventional 
aerogel fibers. We investigated the thermal 
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Fig. 2. Microstructure of 
EAFs with controllable 
shell thickness and char- 
acterization on their 
thermal and mechanical 
properties. (A to D) Radial 
cross-sectional SEM images 
showing the porous core 
with a diameter of 600 um 
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EAF with a tensile load of 500 g, indicating its high strength. ©, diameter of EAF. (G) As the shell thickness increased, the average tensile strength 
increased from 2.3 to 12.7 MPa, and |AT| between the fiber surface and the hot stage (~80°C) decreased from 10.8 to 6.2°C. Error bars indicate the standard 


deviation from five different samples. 


Fig. 3. Thermal insulation 
stability, waterproofness, flexi- 
bility, and dyeability of EAF. 
(A) Schematic diagram of the 
stretching and relaxation process 
of EAF. (B) Optical images of 

EAF when stretched from 0 to 
1000% strain. (C) |AT| between 
the fiber surface and the hot 
stage (~40°C) was measured and 
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insulation performance of the EAF after cyclic 
stretching by measuring the balanced tem- 
perature difference between the fiber surface 
and background hot stage (40°C) (Fig. 3C). The 
thermal insulation of our EAF was extremely 
stable with |AT| steady at around 2.7°C even 
after 10,000 stretching cycles at 100% strain, 
which indicates the property’s retention even 
with daily wear. We attribute this stable ther- 
mal insulation performance to the robust en- 
capsulating layer, which ensures the integrity 
of the fiber. Aside from stretching, our EAF is 
also recoverable from radial compression over 
40% strain (fig. S3). These results indicate that 
by introducing the encapsulating layer, our 
EAF has both thermal insulation and mechan- 
ical robustness. 


Fig. 4. The woven and knitted 
aerogel textiles with outstanding 
thermal insulating ability. 

(A) Optical and IR images 

of commercial and EAF textiles 
on a stage of 40°C. From left to 
ight, the types of the textiles are 
nylon, PET, wool, and EAF, 
espectively. (B) The average |AT| 
between fiber surface and stage 
for the four textiles in (A). Error 
bars indicate the standard deviation 
from three different samples. 

(C) Optical image of a 40 x 25 cm 
EAF textile fabricated on a rapier 
oom, indicating the potential 

of large-scale production. (D) The 
enlarged view of the textile in (C), 
showing the detail of the woven 
structure. (E) Optical image 
showing the folded shape of the 
flexible textile in (C). (F) Optical 
and IR images of ~1-mm-thick 
encapsulated aerogel textile and 
~5-mm-thick down with the same 
thermal insulation performance 

at a stage of 40°C. (G) A detailed 
comparison of the knitted structure 
of EAF and wool, demonstrating 
the excellent machinability of EAF. 
(H) A volunteer wearing our 

EAF textile, down jacket, and wool 
and cotton sweaters in a cold 
environment of -20°C. The surface 
temperatures of these textiles are 
measured to be 3.5° 3.8° 7.2° 

and 10.8°C, respectively, by averaging 
the temperature in the labeled 
areas. Black boxes contain side 
views of the folded garments. The 
EAF sweater and down jacket 

have a thickness difference of more 
than five times. (I) IR image 
showing that our EAF sweater 
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EAF sweater 


Additionally, we prepared colorful EAFs by 
adding various dyes into the spinning solu- 
tions during freeze-spinning (Fig. 3D). Aside 
from protection during stretching, the trans- 
parent, encapsulating TPU layer effectively pre- 
vents the colorful aerogel fibers from fading or 
damage by friction (27). With high flexibility 
(bending stiffness, 17 to 63 x 10° cN-cm?), our 
EAF is knittable, as we demonstrated by mak- 
ing a Chinese knot (Fig. 3E). Moreover, the 
encapsulating layer also provides our fiber 
with excellent waterproofing. Unlike tradi- 
tional materials such as silica aerogel, cotton, 
and down, which easily lose their thermal 
insulation function because of the filling of 
water in their porous structures in humid or 
underwater environments, our EAFs are in- 


_ 


Wool sweater 


es Vel 
idizdudn i £3 


maintained its thermal insulation property at -20°C after washing, indicating its excellent washability. 


Down v- 


trinsically waterproof. As we demonstrated 
(Fig. 3F), the aerogel fiber made with chito- 
san was hydrophilic (fig. $4); it swelled easily 
in water within 10 s and shrunk with severe 
microstructure collapse after drying (fig. S5). 
In the fiber volumetric changing test, we used 
the ratio of volume after soaking and drying to 
the initial volume (V/V ) to evaluate the water- 
proofness, where Vo is the initial volume of 
the fiber and V is volume under the corre- 
sponding environments. The change in vol- 
ume caused by this structural change was 
very large. After soaking in water, the AF ex- 
panded to 153% and shrank to 27% after dry- 
ing (Fig. 3G) when compared with its original 
state. By contrast, our EAF remained intact 
both in water and after drying, with no obvious 


PET Wool EAF 


Nylon 
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ne 
Ant Hl 


after washing 


a °C 
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Cotton sweater EAF 


science.org SCIENCE 


RESEARCH | RESEARCH ARTICLES 


change of the microstructure inside the fiber 
and volume (Fig. 3G and fig. S6). 


Thermal insulation textile based on EAF 


On the basis of their excellent flexibility and 
strength, we further wove EAFs into aerogel 
textiles. We compared the textile woven with 
EAFs with nylon, polyester (PET, polyethylene 
terephthalate), and wool textiles of similar 
thickness by placing them on the same hot 
stage (Fig. 4A and table S1). Accordingly, we 
took a series of IR images when the stage tem- 
perature was stabilized at 40°C. The higher |A7| 
of our EAF textile indicated better thermal in- 
sulation performance (Fig. 4B). We measured 
the thermal conductivity of our EAF textile 
using the steady-state method (supplementary 
materials) (50). The thermal conductivity of our 
textile (26.9 + 18 mW/m-K) was much lower than 
that of nylon (91.2 + 1.6 mW/m-K), PET (98.3 + 
1.9 mW/m-K), and wool (38.9 + 1.1 mW/m-K) 
(table S1). Moreover, we further wove our EAFs 
into a 40 x 25 cm textile using an industrial 
rapier loom (Fig. 4C). This reflects the poten- 
tial of producing scalable thermal insulation 
textiles with our EAFs, which has been a major 
hurdle restricting the development of tradi- 
tional aerogel fibers. An enlarged view of the 
thermal insulation textile (Fig. 4D) shows a 
typical plain-woven structure without defects, 
indicating that our EAFs can resist the shear- 
ing and stretching during weaving. The result- 
ing textile is also flexible enough to be folded 
or bent into various shapes (Fig. 4E and fig. S7), 
which is also crucial for comfortable wearing. 
Down is acommon material for warm cloth- 
ing because of its excellent thermal insulation. 
We compared the thermal insulation property 
of our textile with down by placing them on 
the same hot stage. When they showed similar 
performance in thermal insulation, as indi- 
cated by IR images (Fig. 4F), the EAF textile 
was only around one-fifth as thick as the down. 
Using traditional knitting techniques, we made 
a sweater out of our EAFs with the same knitted 
structure as wool yarn (Fig. 4G). Additionally, 
we tested the thermal insulation property of 
our sweater in a cold environment of -20°C. 
Our biomimetic textile with largely reduced 
thickness, as shown in the inset optical images, 
shows superior thermal insulation efficiency 
than traditional down jackets and wool and 
cotton sweaters, as indicated by the outer tem- 
perature measured from the labeled areas (3.5°, 
3.89, 7.25, and 10.8°C, respectively) in Fig. 4H. 
Additionally, our textile is also washable ow- 
ing to the encapsulating layer. After washing 
with a commercial washing machine, we did 
not observe any obvious change in the appear- 
ance (Fig. 4, H and I). More notably, the ther- 
mal insulation performance of our sweater 
does not degrade, as indicated by the IR image 
in Fig. 41. The outer temperatures before and 
after washing were 3.5° and 3.3°C, respectively, 
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in a -20°C environment. All the data shown 
above demonstrate that the textile woven with 
our EAFs excellently combines thermal insu- 
lation, mechanical strength, flexibility, knitt- 
ability, dyeability, and waterproofness, which 
are all crucial for the development of next- 
generation thermal insulation textiles. 


Conclusions 


We tackled the long-standing hurdle for the 
practical application of fragile aerogel in ro- 
bust thermal insulation textiles. Drastically 
different from traditional strategies of dop- 
ing or coating aerogel in the textile, we used a 
biomimetic approach in which thermal and 
mechanical properties of the aerogel fiber 
were separately designed. We achieved this by 
developing a relatively strong polymeric aero- 
gel fiber with lamellar pores and then encap- 
sulating it with a stretchable rubber layer. With 
an encapsulated layer only about 80 um thick, 
the tensile strain of our encapsulated aerogel 
fiber was greatly increased more than 500 times 
compared with traditional aerogel fibers. More 
notably, the EAF could endure more than 10,000 
loading-unloading stretching cycles with ex- 
treme stability for thermal insulation, indicat- 
ing its remarkable durability. Because of the 
protection of the encapsulating layer, the EAFs 
are suitable for weaving and knitting, which 
are essential for large-scale preparation. A thin 
sweater made of our biomimetic fiber is only 
around one-fifth as thick as a down jacket for 
the same warmth. Additionally, our textile also 
exhibits excellent flexibility, washability, and 
dyeability, which are all crucial for practical 
applications of the textile. Such textiles feature 
excellent thermal insulation and multifunc- 
tion and have great potential in areas such as 
military uniforms and space suits in extremely 
cold environments (57). Beyond this proof-of- 
concept work, our research highlights a different 
strategy for biomimetic design of multifunc- 
tional fibers and textiles, which are required for 
various emerging applications such as wearable 
electronics and personal thermal management. 
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Genomic evidence for West Antarctic Ice Sheet 
collapse during the Last Interglacial 


Sally C. Y. Lau’?*, Nerida G. Wilson***, Nicholas R. Golledge®, Tim R. Naish®, Phillip C. Watts’, 
Catarina N. S. Silva’®, Ira R. Cooke®, A. Louise Allcock®®, Felix C. Mark”, 


Katrin Linse’?, Jan M. Strugnell? 


The marine-based West Antarctic Ice Sheet (WAIS) is considered vulnerable to irreversible collapse 
under future climate trajectories, and its tipping point may lie within the mitigated warming scenarios of 
1.5° to 2°C of the United Nations Paris Agreement. Knowledge of ice loss during similarly warm past 
climates could resolve this uncertainty, including the Last Interglacial when global sea levels were 5 to 
10 meters higher than today and global average temperatures were 0.5° to 1.5°C warmer than preindustrial 
levels. Using a panel of genome-wide, single-nucleotide polymorphisms of a circum-Antarctic octopus, 

we show persistent, historic signals of gene flow only possible with complete WAIS collapse. Our results 
provide the first empirical evidence that the tipping point of WAIS loss could be reached even under 


stringent climate mitigation scenarios. 


limate change continues to cause un- 

precedented warming to Earth’s sys- 

tem (7). The consequences of warming 

are leading to rapid changes in Antarctica, 

including Antarctic Ice Sheet mass loss, 
with global impacts (7). A major uncertainty in 
global mean sea level (GMSL) rise projections 
lies in the potential instability of the West 
Antarctic Ice Sheet (WAIS) (2). The marine- 
based WAIS has lost 159 + 8 gigatons of ice 
mass per year between 1979 and 2017 (3) and 
will continue to be a major contributor to 
GMSL rise under all CO, emission scenarios (4). 
It is unclear whether the WAIS is vulnerable to 
rapid ice loss or even full collapse as a result 
of poor understanding of future changes and 
processes that influence ice sheet dynamics 
(2). WAIS collapse could raise global sea level 
by ~3.3 to 5 m (5, 6), with direct consequences 
that include human displacement and global 
loss of ecosystems in coastal areas (1). 

It is well understood from geological recon- 
structions that there were interglacial peaks 
(super interglacials) during periods of the 
Pleistocene that experienced warmer temper- 
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atures (+~0.5° to 1.5°C) and higher GMSL (up 
to +10 m) than present (4). These super inter- 
glacials include Marine Isotope Stages (MIS) 31, 
11, and 5e, which occurred at approximately 
~1.08 to 1.05 Ma, ~424 to 395 ka and ~129 to 
116 ka, respectively (4). During MIS 31, the South- 
ern Ocean sea surface temperature may have 
reached +5° + 1.2°C above present during summer 
months (7). During MIS 11, global mean surface 
temperature (GMST) was 0.5° + 1.6°C with GMSL 
6 to 13 m higher than present and similarly 
during MIS 5e (the last interglacial), GMST 
was +0.5° to 1.5°C with GMSL5 to 10 m higher 
than present (4). To date, there is no empirical 
evidence to indicate whether the WAIS has 
completely collapsed at any time in the three 
million years since the Pliocene (8, 9). Inferring 
WAIS configurations during late Pleistocene 
super interglacial periods could therefore in- 
form the sensitivity of Antarctic ice sheet re- 
sponse to climate change. So far, analyses of 
ice proximal marine drill core records show 
evidence of WAIS retreat during the late Pleis- 
tocene interglacials, but the exact timing (J0) 
and extent (9, 17) of any WAIS collapse remain 
ambiguous. Existing ice sheet models have 
yielded conflicting WAIS reconstructions dur- 
ing these periods, ranging from no collapse (12) 
to partial (13) or full collapse (/4, 15). Knowledge 
of how the WAIS was configured during super 
interglacials in the geological past is urgent- 
ly needed to constrain future sea level rise 
projections (2). Novel approaches such as pop- 
ulation genomics can serve as empirical proxies 
of past changes to the Antarctic Ice Sheet, de- 
tected through signals of historic gene flow 
among currently separated populations of ma- 
rine organisms (J6). 

A complete past collapse of the WAIS would 
have opened the trans-West Antarctic seaways 
linking the present-day Weddell Sea (WS), 
Amundsen Sea (AS), and Ross Sea (RS) (J6). 
Such seaways would have allowed marine 


benthic organisms to occupy the opened straits 
and disperse across them, thus leaving genetic 
signatures of this past connectivity in the ge- 
nomes of their descendent, extant populations 
(16) (hereafter seaway populations). As the 
WAIS reformed, these organisms would be iso- 
lated again within the WS, AS, and RS basins, 
with any subsequent connectivity only possible 
around the continental margin. Although there 
is some support for existence of trans-Antarctic 
seaways based on species assemblage data at 
macroevolutionary scales (17-20) or low-resolution 
genetic data (21-24), all these studies lack 
power and/or spatial coverage to distinguish 
between past dispersal through trans-West 
Antarctic seaways or from contemporary cir- 
cumpolar ocean currents. Importantly, these 
previous studies cannot be used for accurate 
demographic modeling to identify the likely 
timing of any collapse of the WAIS. 

Collection of benthic species from the South- 
ern Ocean is logistically challenging and re- 
gions such as AS and East Antarctica (EA) are 
difficult to access. Existing samples are typically 
characterized by DNA degradation due to long- 
term storage in collections at room temper- 
ature. We used a target capture approach that 
sequenced genome-wide, single nucleotide poly- 
morphism (SNP) data in the circum-Antarctic 
benthic octopus, Pareledone turqueti, incorporat- 
ing rare samples from AS and EA, collected over 
33 years. Our approach enabled a comprehensive 
sampling strategy to robustly date and test for 
the presence of past trans-West Antarctic sea- 
ways using biological data as proxies. 

We sequenced genome-wide SNPs derived 
from double-digest restriction site-associated 
DNA (ddRAD) (25) loci from 96 P. turqueti 
individuals collected from around the Southern 
Ocean (Fig. 1A). The dataset presents a circum- 
Antarctic overview of the species’ genetic patterns, 
which record the contemporary connectivity 
driven by oceanic currents—mainly the Antarctic 
circumpolar current (ACC; clockwise) and the 
Antarctic Slope Current (ASC; counterclockwise) 
(Fig. 1, A and B), as well as any historical con- 
nectivity that would be associated with past 
trans-West Antarctic seaways. We used a 
reduced SNP dataset (one SNP per locus) to 
analyze population structure, which included 
5188 unlinked SNPs. Complementary analyses 
(Structure, TreeMix) suggest that the population 
genomic variation of P. twrqueti is characterized 
by geographically structured populations across 
the Southern Ocean (Fig. 1C and figs. S1 and S2). 
This makes it an appropriate species to test 
for the presence of trans-West Antarctic sea- 
ways as signals of historical WAIS collapse in 
highly admixed species would likely be masked 
by contemporary gene flow signatures (6). 
Pareledone turqueti can disperse through ben- 
thic crawling as adults and hatchlings [inferred 
from large egg size (maximum oocyte length 
19.8 mm) and small number of eggs per clutch 
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Fig. 1. Sample locations of P. turqueti with Structure analyses. (A) Samples 
used for analyses of population structure. Abbreviations: Shag Rocks, SR; 
South Georgia, SG; South Orkney Is., SOI; Elephant Is., El; Bransfield Strait, BS; 
South Shetland Is., SHE; West Antarctic Peninsula, WAP; South and East Weddell 
Sea, S- and E-WS, respectively; Amundsen Sea, AS; Ross Sea, RS; Adélie Land, 

AL; East Casey Station, ECS; Prydz Bay, PB. (B) Samples used for admixture 
analyses and demographic modeling (collectively demographic inferences) to test 
for the existence of trans—West Antarctic seaways. Map includes the 
directionalities of the major contemporary circumpolar currents and regional 


currents in the Southern Ocean. Black arrows indicate connectivity pathways 
through trans—West Antarctic seaways that would result from WAIS collapse. 
Direct connectivity between WS-AS or AS-RS would indicate partial WAIS 
collapse, and direct connectivity between WS-AS-RS or WS-RS would indicate 
complete WAIS collapse. [Photo credit of P. turqueti specimen: E. M. Jorgensen] 
(C) Clustering analysis using Structure inferred K = 7 for P. turqueti 

(5188-SNP dataset). Each horizontal bar represents an individual sample, bars 
are grouped by geographical locations, and colors within each bar correspond to 
the proportion of each genetic cluster in the individual. 


(n = 22 to 66)] (26, 27). It has also been sug- 
gested that long-distance dispersal in P. twrqueti 
may be achieved, at least occasionally, through 
adults or egg masses rafting on floating sub- 
strates, or that their benthic egg masses could 
become dislodged and disperse through the 
currents (23), although no direct evidence yet 
supports this. 
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In P. turqueti, long-distance connectivity link- 
ing East and West Antarctica is detected across 
the Antarctic continental shelf and Antarctic 
islands (Fig. 1C), likely indicating dispersal that 
reflects contemporary circumpolar currents, 
as found in other Southern Ocean benthic taxa 
(28). If the genomic data of P. turqueti can 
only be explained by serial circumpolar colo- 


nization around the Antarctic continent, then 
observed and expected heterozygosity would de- 
crease as geographical distance increases. How- 
ever, the observed data does not support this 
scenario (fig. S3 and table S1). Admixture is 
also observed between individuals from RS and 
AS with some individuals from WS (Fig. 1C), 
indicating a potential signature of trans-West 
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Fig. 2. Evidence of distinct allele frequency correlations between Amundsen 
Sea, Weddell Sea, and Ross Sea, as well as contemporary gene flow in P. turqueti. 
Error bars (black horizontal lines) indicate standard errors; filled circles indicate 
significant (Z-score values > 3 or < -3); Out, outgroup population, which includes 
Shag Rocks and South Georgia (samples combined). Data includes a 120,857- 
SNP dataset. Abbreviations: Weddell Sea (WS), South Shetland Islands (SHE), 
Amundsen Sea (AS), Ross Sea (RS), East Antarctica (EA). (A) Outgroup-f3- 
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form of D(Pop, EA, WS, Out), which examines whether there is excess allele 
sharing between EA and WS (D<0; ABBA) or Pop and WS (D>0; BABA). 


Antarctic seaways. Supporting this concept, 
population differentiation analysis shows lim- 
ited genetic divergence between WS-RS relative 
to other localities that are adjacent to each 
other (fig. S4). 

Focusing on populations most informative 
of whether the WAIS collapsed in the past, we 
first examined whether there was distinct ad- 
mixture between WS-AS-RS with respect to 
South Shetland Islands (SHE) and East Antarc- 
tica (EA) (including Prydz Bay and East Casey 
Station) samples using 120,857 SNPs (Fig. 1B). 
SHE and EA are known to be influenced by 
both the ACC and ASC (29) but are peripheral 
to the putative historical trans-West Antarctic 
connectivity; as such, these are ideal locations 
that can separate patterns of present-day con- 
nectivity around the WAIS and East Antarctic 
Ice Sheet (EAIS) from persistent, historical sig- 
nals of gene flow. 

We examined allele frequency correlations 
across WS, AS, and RS with respect to SHE 
and EA. The D-statistic (30) explores the pat- 
terns of allele sharing across four populations 
to test for evidence of admixture between 
populations of interest. The outgroup-f3- 
statistic (31) explores the amount of derived 
allele frequency that is shared between pairs 
of populations relative to an outgroup popu- 
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lation. The presence of admixture linked to 
trans-West Antarctic connectivity would result 
in high fs values as well as evidence of excess 
allele sharing (D>0) between WS-AS-RS. In 
P. turqueti, the highest f; values are detected 
between AS-RS, followed by RS-EA and RS- 
WS (Fig. 2A), thus indicating recent common 
ancestry across seaway populations as well as 
between RS and EA populations that are 
adjacent to each other. When SHE is treated as 
the sister lineage to AS/RS and WS [D(AS/RS, 
SHE, WS, outgroup)], there is excess allele 
sharing between AS/RS and WS (Fig. 2B). 
When EA is treated as a sister lineage to 
AS/RS and WS [D(AS/RA, EA, WS, outgroup)], 
excess allele sharing is also observed between 
AS/RS and WS (Fig. 2B). If cireumpolar cur- 
rents (ASC and ACC) were the only factors that 
have influenced gene flow patterns in P. turquett, 
then low f; values would be observed between 
WS-RS as they are situated on the opposite side 
of West Antarctica, and excess allele sharing (D) 
would also be observed between WS-SHE and 
WS-EA as they are geographically adjacent. 
However, these results confirmed that in 
P. turqueti there are unexpected and signif- 
icant allele frequency correlations among AS- 
RS-WS, despite also considering the locations 
situated between them around the WAIS (SHE) 


and EAIS (EA). Such observed admixture pat- 
terns are congruent with historical seaway con- 
nectivity in a species that is characterized by 
geographically structured populations. 

A site frequency spectrum (SFS)-based, coales- 
cent demographic modeling framework (fast- 
simcoal) (32) was used to test the hypothesis 
of whether historical trans-Antarctic seaways 
existed, with populations subsequently influ- 
enced by contemporary circumpolar gene flow. 
For demographic modeling, we included sam- 
ples from WS, AS, RS, and EA with 163,335 
SNPs in P. turqueti. We employed a hierarchi- 
cal approach to test for WAIS collapse scenarios 
while incorporating modern circumpolar gene 
flow in the models (figs. S5 and S6). Step 
1 compared contrasting scenarios of past WAIS 
configurations with circumpolar gene flow fol- 
lowing the directionality of the ACC (clockwise). 
The models incorporated WS, EA, RS, and AS 
experiencing continuous circumpolar gene flow 
since population divergence. Under these sce- 
narios, after population divergence WS, AS, and 
RS experienced no, partial, or complete con- 
nectivity, followed by modern ACC gene flow 
linking between WS, EA, RS, and AS. Limited 
differentiation was found between competing 
scenarios (no, partial, or complete connectivity) 
at step 1 (figs. S7 and S8). Therefore, at step 2 
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model complexity was increased to model 
more ecologically realistic scenarios, with cir- 
cumpolar gene flow following both direction- 
alities of the ACC and ASC (counterclockwise) 
for all scenarios (figs. S9 and S10). At step 3, 
unmodeled ancestral size change was further 
considered to distinguish competing models 
from the previous step. 

The observed SFSs were statistically best ex- 
plained by the scenario of a complete histori- 
cal WAIS collapse (Fig. 3A and figs. S11 and 
$12), followed by modern circumpolar gene 
flow linked to ACC and ASC. The ancestral 
lineage of WS, AS, RS, and EA populations ex- 
perienced an expansion followed by a bottleneck 
(Fig. 3A and table $2). During the mid-Pliocene, 
which ended ~3 million years ago [95% confi- 
dence interval (CI) between 3.6 and 3.5 million 
years ago (Mya)], the best model supports WS, 
AS, RS, and EA continental shelf locations 
splitting into four populations with direct 
asymmetric gene flow detected between WS- 
AS-RS. This suggests that ancient seaways were 
likely opened across the WAIS, which directly 
linked the present-day WS, AS, and RS, and 
could only be facilitated by WAIS collapse 
during past interglacials. The start of the 
historical, direct WS-AS-RS connectivity began 
~3.6 to 3.0 Mya, which supports geological 
evidence of historical WAIS collapses during 
the Pliocene (8, 9). The signature of direct WS- 
AS-RS connectivity ceased between 139 and 54: ka 
(based on 95% CI; Fig. 3A), which is in accord- 
ance with the end of the Last Interglacial. 

Considering the estimated confidence inter- 
vals, the cessation of direct gene flow between 
WS-AS-RS could only be associated to the most 
recent interglacial MIS 5e (129 to 116 ka) be- 
cause the prior super interglacial (MIS 11; ~424 
to 395 ka)—which would also enable such di- 
rect connectivity—is far outside of the upper 
uncertainty bound of the connectivity. The 
maximum likelihood value of the direct WS- 
AS-RS connectivity was dated to 87 ka; the 
time lag between 87 ka and the time of MIS 
5e (129 to 116 ka) can be explained at least in 
part by the time it takes for complete trans- 
West Antarctic migration to influence allele 
frequencies in a benthic direct developing oc- 
topus. Finally, contemporary circumpolar gene 
flow began after 87 ka, following the direc- 
tions of the ACC and ASC, which generally 
reflects the influence of ongoing circumpolar 
currents on gene flow of Southern Ocean ben- 
thic taxa, in particular the ACC (28). For exam- 
ple, the high migration rates observed from 
WS-—EA and RS—AS reflect the strength of 
ACC relative to ASC. To support the logic that 
populations were split from the same ancestral 
population, we considered the alternative tree 
topologies of diverse three-population models 
involving WS, RS and EA, or SHE (figs. S13 and 
S14). In all cases, recent, shared ancestry between 
RS and WS is not an alternative explanation 
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Fig. 3. Best-supported demographic model for P. turqueti indicating that signatures of complete 
historical WAIS collapse began during the mid-Pliocene and continued until the end of MIS 5e, 
supplemented by a StairwayPlot that indicates past changes in population size. (A) Maximum 
likelihood model for P. turqueti including the Amundsen Sea (AS), Ross Sea (RS), Weddell Sea (WS) and East 
Antarctica (EA) populations, as well as direct historical gene flow (3 Mya to 87 ka) between WS, AS, and 
RS and modern gene flow (87 to O ka) around the Antarctic continental shelf following the directionality of 
the Antarctic Circumpolar Current (ACC; clockwise), and the Antarctic Slope Current (ASC; counter- 
clockwise). Maximized parameter estimates are visualized. The associated 95% confidence intervals (Cl) 
are in brackets and reported in Table S2. Time of the events modeled are shown on the left. The width of the 
bars is proportional to the effective population size of the population. Arrows indicate migration (forward in 
time), with the width of the arrows proportional to the number of migrants per generation (2Nm). The 
migration pathways, based on modeled migration directions (forward in time), are also visualized on a map of 
Antarctica. Map shows subglacial bed elevation >O0 m above present-day sea level and is extracted from 
Bedmap2 (40). Data includes a 163,335-SNP dataset. (B) StairwayPlot reconstruction of past changes in 
effective population size over time in P. turqueti in the last ~4 Mya. Line indicates median and shaded 
area indicates 2.5 and 97.5% confidence limits. Data includes a 191,024 SNP-dataset. Dashed vertical lines 
represent timing of MIS 5e (~125 ka), MIS 11 (~424 ka), and MIS 31 (~1.08 Mya). Abbreviations: 

Holocene, Hol. 
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for the observed data (tables S3 and S4), with 
consistent outcomes supporting historic direct 
gene flow between WS-RS ending around the 
Last Interglacial. The implication is that the con- 
clusion of WAIS collapse is robust to the model 
assumption of a split from the same ancestral 
population. 

One of the biggest challenges of inferring 
demographic events in the late Pleistocene in- 
clude whether the species experienced a severe 
bottleneck (i.e., sharp reduction in NV) in the 
recent past that eroded genomic history. For 
example, if the WAIS had collapsed in the late 
Pleistocene, large areas of newly ice-free habi- 
tats (where the WAIS existed previously) would 
have become available for benthic fauna to 
disperse and colonize. During the subsequent 
glacial maximum, as the AIS expanded across 
the Antarctic continental shelf, the marine shelf 
habitats would likely be reduced to small, 
isolated pockets of in situ ice-free refugia (33). 
Such changes in habitat availability would 
likely lead to severe population bottlenecks 
and subsequent genetic drift (33). As a result, a 
recent severe bottleneck could lead to loss of 
alleles and increased coalescence rate and 
events only dating back to the time of most 
recent bottleneck (34) (i.e., LGM at ~20 ka in 
P. turqueti). Demographic models detected 
events prior to LGM in P. turqueti, suggesting 
this species did not experience population bottle- 
necks in the recent past as severe as might be 
expected for a Southern Ocean benthic species. 

We searched for signals of population size 
fluctuation prior to the LGM in P. turqueti 
using StairwayPlot (35), an SFS-based model- 
free method. We found that the demographic 
changes dated by StairwayPlot generally cor- 
respond with the dating of gene flow changes by 
fastsimcoal. Pronounced demographic changes 
(observed from Fig. 3B) were detected in the RS 
and WS populations at ~435 ka and ~200 ka, 
respectively. These timings coincide with the 
glacial stage MIS 12 and the relatively cool in- 
terglacial of MIS 7, both of which were subse- 
quently followed by periods of peak warmth 
during MIS 11 and MIS 5e respectively. In spe- 
cies with population structure, population de- 
cline at a particular timepoint in StairwayPlot 
corresponds to signals of demographic change 
such as changes in population structure and 
migration rate (36). In the case of P. turqueti, 
the population decline detected in RS around 
MIS 11 and in WS around MIS 5e corresponds 
to demographic change likely associated with 
WAIS collapse. In the fastsimcoal model (Fig. 
3A), the population decline detected across all 
populations after the Last Interglacial also cor- 
responds to the widely accepted hypothesis that 
there would be limited in situ ice-free refugia 
on the Antarctic continental shelf during the 
LGM, leading to population bottlenecks in ben- 
thic species that could only survive on the shelf 
(33) (i.e., the case for P. turqueti) (23). These 
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patterns suggest that recent super interglacials 
and the LGM likely strongly influenced species 
demography, particularly in populations asso- 
ciated with the signatures of WAIS collapse. 

Estimation of past changes in population 
size with StairwayPlot indicated that those 
in the AS and EA were relatively stable since 
3 Mya whereas fastsimcoal indicated size changes 
in these populations; such discordances are likely 
due to method-specific sensitivity in popula- 
tions with low sample size in regions where it 
is challenging to collect biological samples (7 < 
5 in AS and EA). Overall, the timing of demo- 
graphic events detected by independent infer- 
ences also corroborates the timings of major 
glacial-interglacial fluctuations in the Pliocene 
and the late Pleistocene, as well as events 
dated using independent markers (mitochon- 
drial data) (23). Therefore, the dating of WAIS 
collapse, as seen through the genomic data of 
P. turqueti appears to be robust and uncon- 
founded by noise. 

Our demographic modeling approach was 
specifically designed to test whether trans-West 
Antarctic seaways existed in the past that could 
be detected with simple contrasting models. 
The best-supported demographic model was 
able to characterize an overview of the his- 
torical, direct WS-AS-RS connectivity linked 
to WAIS collapses through the Pliocene and 
as late as MIS 5e. Although additional periods 
of seaway closure occurred during other geo- 
logical periods between 3 Mya and MIS 5e, 
our analyses are focused on dating the most 
recent gene flow linked to historical WAIS 
collapse. The evolutionary history of P. turqueti 
is highly complex and populations would have 
experienced specific demographic changes 
associated with each glacial-interglacial cycle 
throughout the Quaternary. We did not se- 
quentially reconstruct their past changes in 
population size and connectivity patterns to 
avoid over-parameterization. Signatures of 
recent events are more clearly encoded in allele 
frequencies. We utilized a target capture approach 
that can sequence reduced representation 
genomic data in samples with DNA degrada- 
tion. Our reduced representation SNP dataset 
has the power to reflect recent major events 
dated to around MIS 5e, MIS 11, and the 
Pliocene. Regardless of the overall challenge of 
demographic modeling for Southern Ocean 
species, signatures of a complete WAIS col- 
lapse, last detected at around the MIS 5e, 
were clear in P. turqueti. 

We provide empirical evidence indicating 
that the genomic signatures of marine-based 
sectors WAIS collapse were present during the 
Last Interglacial (MIS 5e), when GMST was 
0.5° to 15°C warmer than the preindustrial. 
Future WAIS collapse on centennial timescales 
is considered to be a low likelihood process (4); 
however, in recent trajectories estimated for 
temperature rise, such as for the most optimistic 


emission scenario Shared Socio-economic Path- 
way (SSP) 1-1.9, air temperature is projected to 
reach +1.2° to 17°C by 2100 (very likely range) (4). 
Moreover, Antarctic Ice Sheet models simu- 
lating the response to Intergovernmental Panel 
on Climate Change (IPCC) emissions scenarios 
(15, 37) show that a threshold is crossed when 
warming is above SSP2-2.6 (the Paris climate 
target) whereby ice shelves are lost and marine- 
based sectors of WAIS undergo self-reinforcing 
melting due to marine ice sheet instability pro- 
cesses. This study provides empirical evidence 
indicating that the WAIS collapsed when the 
global mean temperature was similar to that of 
today, suggesting that the tipping point of fu- 
ture WAIS collapse is close. Future global sea 
level rise projections should consider the irre- 
versible collapse of the WAIS and some marine 
sectors of the EAIS (38), which will commit the 
planet to multimeter GMSL over the coming 
centuries and millennium if global warming 
exceeds +1.5° to 2°C above preindustrial levels 
, 8, 39). 
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GluD1 binds GABA and controls inhibitory plasticity 


Laura Piot!, Christina Heroven*{+, Simon Bossi’t, Joseph Zamith’, Tomas Malinauskas°, 
Chris Johnson’, Doris Wennagel’, David Stroebel’, Cécile Charrier’, A. Radu Aricescu2*, 


Laetitia Mony"*, Pierre Paoletti?* 


Fast synaptic neurotransmission in the vertebrate central nervous system relies primarily on ionotropic 
glutamate receptors (iGluRs), which drive neuronal excitation, and type A y-aminobutyric acid receptors 
(GABA,Rs), which are responsible for neuronal inhibition. However, the GluD1 receptor, an iGluR family 
member, is present at both excitatory and inhibitory synapses. Whether and how GluD1 activation may 
affect inhibitory neurotransmission is unknown. In this work, by using a combination of biochemical, 
structural, and functional analyses, we demonstrate that GluD1 binds GABA, a previously unknown 
feature of iGluRs. GluD1 activation produces long-lasting enhancement of GABAergic synaptic currents in 
the adult mouse hippocampus through a non-ionotropic mechanism that is dependent on trans-synaptic 
anchoring. The identification of GluD1 as a GABA receptor that controls inhibitory synaptic plasticity 
challenges the classical dichotomy between glutamatergic and GABAergic receptors. 


ost excitatory neurotransmission in the 
central nervous system is mediated 
by the ionotropic glutamate receptor 
(iGluR) family members, which are 
classified as AMPA (GluA1 to 4), N- 
methyl-p-aspartate (NMDA) (GluN1, GluN2A 
to D, and GluN3A and B), kainate (GluK1 to 5), 
and delta (GluD1 and D2) receptors (1). At a 
typical excitatory synapse, glutamate released 
in the synaptic cleft binds postsynaptic iGluRs 
and drives opening of their cation-conductive 
ion channel, which results in membrane de- 


Unstitut de Biologie de I'ENS (IBENS), Ecole Normale 
Supérieure, Université PSL, CNRS, INSERM, F-75005 Paris, 
France. *MRC Laboratory of Molecular Biology, Cambridge 
CB2 OQH, UK. “Division of Structural Biology, Wellcome 
Centre for Human Genetics, University of Oxford, Oxford 
OX3 7BN, UK. 

*Corresponding author. Email: radu@mrc-Imb.cam.ac.uk (A.R.A.); 
laetitia.mony@ens.psl.eu (L.M.); pierre.paoletti@ens.psl.eu (P.P.) 
tThese authors contributed equally to this work. 

Present address: CAMS Oxford Institute, University of Oxford, Oxford 
OX3 7BN, UK. 


SCIENCE science.org 


polarization. Although structurally related (2), 
GluD1 and GluD2 receptors depart from other 
iGluRs by their insensitivity to glutamate and 
inability to gate the ion channel pore upon 
ligand binding (3, 4). Instead, GluD receptors 
are thought to exert their physiological func- 
tion by binding p-serine (or glycine) (5-7) and 
through metabotropic (i.e., non-ionotropic) 
signaling (4, 8) [although ionotropic GluD sig- 
naling that is dependent on G protein-coupled 
receptors has also been suggested (9)]. This 
mechanism is best characterized for GluD2 
receptors. Postsynaptic GluD2 binds cerebellin-1, 
a “molecular bridge” attached to presynaptic 
membrane proteins from the neurexin family, 
to form tripartite complexes that span the syn- 
aptic cleft and are involved in synapse forma- 
tion, maintenance, and plasticity (4, 10-13). 
Although GluD1 can also form triads with 
cerebellins and neurexins (8, 14), relatively 
little is known about GluD1 signaling and its 
functional roles. Expressed throughout the 


forebrain (15-7), GluD1 receptors participate 
in excitatory synaptogenesis (18, 19) and reg- 
ulate the balance between AMPA and NMDA 
receptors (20). GluD1 also specifies the forma- 
tion of inhibitory synapses in the neocortex 
(21-24). 


Results 
GluD1 receptors bind GABA 


Intrigued by the presence of GluD1 receptors 
at inhibitory postsynaptic sites (23, 24), which 
is unparalleled among iGluRs, we investigated 
the ligand-binding profile and activation of 
GluD1 receptors. Because agonist binding does 
not elicit currents in delta receptors, we first 
introduced in GluD1 the “Lurcher” A654T 
(Ala®*— Thr) mutation, which renders GluD2 
receptors constitutively active (25, 26). After 
the expression of GluD1-A654T mutant recep- 
tors into Xenopus oocytes, small but reliable 
constitutive inward currents were observed 
(fig. S1, A and B). Application of p-serine or 
glycine potentiated these currents (fig. S1, B 
and C), which contrasts with the current in- 
hibition observed on GluD2-Lurcher receptors 
(fig. S2, A to C) (6, 27). Because GluD1-A654T 
constitutive currents were small, we inserted 
an additional mutation, C645I (Cys*°—Tle), to 
match the equivalent GluD2 M3 pore segment 
(fig. SID). This double mutant GluD1-A654T- 
C6451, named GluD1-Lurcher hereafter, yielded 
larger constitutive currents (fig. SIE) that were 
also potentiated by p-serine and glycine (Fig. 1, 
Ato C) while displaying almost no sensitivity to 
glutamate (Fig. 1C and fig. S3, A to C). Agonist 
dose-response curves on GluD1-Lurcher revealed 
p-serine potency in the high micromolar range 
{median effective concentration (EC;,) of 320 + 
10 uM (n = 13 to 17 cells); Fig. 1C and table S1], 
similar to that measured on GluD2-Lurcher 
receptors [median inhibitory concentration 
(IC59) of 260 + 10 uM (n = 5 or 6 cells); fig. S2, 
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Fig. 1. GABA binds and triggers conformational changes at GluD1 recep- 
tors. (A) Schematic of the two-electrode voltage clamp technique performed on 
Xenopus oocytes that express GluD1-Lurcher receptors. TMD, transmembrane 
domain. (B) p-serine and GABA potentiate GluD1-Lurcher constitutive currents. 
The pore blocker pentamidine (Penta, 100 uM) was used to estimate the amount 
of constitutive current. Dashed lines represent the constitutive Lurcher currents. 


(C) Dose-response curves for GABA, p-serine, glycine, 


Lurcher receptors. Values and sample numbers are provided in table S1. Error 


bars indicate SD. (D) Schematic of VCF performed on 


express GluD1*-E525C labeled with Alexa488-maleimide. GluD1* is the receptor 


A to C, and (5, 27, 28)]. Glycine sensitivity 
appeared rightward shifted, in the low milli- 
molar range [ECs9 of 3.4 + 0.2 mM (n = 9 
cells); Fig. 1C and table S1], reminiscent of 
the glycine sensitivity of GluD2-Lurcher recep- 
tors [ICs9 = 0.75 + 0.05 mM (n = 10 cells); fig. 
S2C]. We then tested the ability of GluD1 re- 
ceptors to sense y-aminobutyric acid (GABA). 
GABA robustly potentiated GluD1-Lurcher cur- 
rents with an apparent affinity in the low milli- 
molar range [ECs9 = 3.0 + 0.2 mM (7 = 15 to 17 
cells)], lower than that of p-serine but sim- 
ilar to that of glycine, and with equal efficacy 
(Fig. 1, B and C; see also fig. S1, B and C, and 
table S1). This GABA effect was displaced by 
p-serine (fig. S3, D to F). By contrast, GABA was 
essentially ineffective on GluD2-Lurcher recep- 
tors, producing modest current inhibition and 
limited p-serine displacement only at high con- 
centrations (fig. S2, D to F). Mutating conserved 
iGluR agonist-binding residues (J) in the GluD1 
ligand-binding domain (LBD) fully abolished 
responses to GABA (figs. S2, G to I, and S83, G 
to I), revealing the primary role of the GluD1 
LBD in GABA sensing. Application of GABA 
(10 mM) onto NMDA (fig. S4, A to D) and AMPA 
(fig. S4, E to H) receptors triggered negligible 
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and glutamate on GluD1- 


Xenopus oocytes that (Dunn's pos 


effects, which is consistent with insensitivity 
to GABA. 

We next aimed to confirm these observations 
in GluD1 receptors devoid of the Lurcher muta- 
tions. We used voltage clamp fluorometry (VCF), 
a technique that enables detection of ligand (or 
voltage)-induced conformational changes in 
membrane proteins (29). We designed a series 
of GluD1 receptors with cysteines engineered 
at various positions in the LBDs for attachment 
of the environment-sensitive fluorescent probe 
Alexa488 and screened for potential VCF fluo- 
rescence changes (AFs) upon ligand addition 
(Fig. 1D). Several positions yielded robust AFs 
upon p-serine (or glycine) application. Posi- 
tion E525 (E, Glu) in the structurally sensitive 
D1-D1 dimer interface (27) produced particu- 
larly strong signals (table S2). We verified that 
the E525C (Glu®”’—>Cys) mutation and the flu- 
orescent probe attachment had minimal ef- 
fects on the properties of the receptor (fig. 
85). Application of GABA on labeled GluD1- 
E525C receptors also triggered marked AFs, 
which revealed the ability of GABA to bind and 
elicit conformational changes at (non-Lurcher) 
GluDI1 receptors (Fig. 1, D to F, and fig. S6, A to 
C). Glutamate application had hardly any effect 


containing the C721S mutation (Cys’“!—Ser). (E) VCF signals of GluD1*-E525C 
and GluD1*-E525C-R526K-D742A receptors upon application of p-serine, glycine, 
GABA, and glutamate (10 mM each). R526K, Arg??°—Lys; D742A, Asp”“7—Ala. 
(F) AF amplitudes triggered by 10-mM ligand application on oocytes that express 
either GluD1*-E525C or GluD1*-E525C-R526K-D742A receptors and on non- 
injected oocytes. The center line represents the median, box limits are upper and 
lower quartiles, and whiskers are minimum and maximum values. Values and 
sample numbers are listed in table S5. ***P < 0.001, and n.s. is not significant 
hoc test). Detailed replicate numbers and the full statistical analysis 
are provided in table S6. 


(Fig. 1, E and F). No fluorescent signals were 
detected upon p-serine, glycine, or GABA ad- 
dition onto binding-deficient GluD1 receptors 
(Fig. 1, E and F, and fig. S6, D to F), which 
demonstrates that ligand binding at the GluD1 
LBD is the critical initial molecular event. 
VCF-based dose-response curves showed agonist 
apparent affinities in the sub- to low-millimolar 
range, with GABA sensitivity similar to that of 
glycine but lower than that of p-serine (fig. S6, B 
and C), which thus corroborates results ob- 
tained on GluD1 Lurcher receptors. Thermal 
shift assay and isothermal titration calorimetry 
(ITC) experiments on isolated LBDs confirmed 
direct binding of GABA (and p-serine, but not 
glutamate) to the GluD1 LBD with a binding 
affinity of 2 mM (Fig. 2, A and B; fig. $7, A and 
B; and table S3), which is consistent with our 
results on full-length GluD1 receptors. Con- 
versely, GluD2 LBDs bound p-serine but not 
GABA (fig. S7, C to F). Therefore, GABA ap- 
pears to be an active ligand at GluD1, but not 
GluD2, receptors. 


Structure of the GABA binding site 


To understand how GluD1 interacts with GABA, 
we determined the crystal structure of the GluD1 
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Fig. 2. Structures of the GABA-GluD1 and p-serine-GluD1 LBD complexes. (A and B) ITC analysis of p-serine (A) and GABA (B) binding to the isolated GluD1 LBD; 
values are listed in table S3. Kp, dissociation constant. (€) X-ray crystal structures of the GluDl LBD in complex with GABA (left, blue) and b-serine (right, cyan) 
superimposed with the apo GluD1 LBD structure (tan). The extent of GluD1 LBD clamshell closure upon ligand binding is indicated. (D) Close-up view of the GABA 
binding site, which shows the 2mFo-DFc electron density map contoured at 1.0. Distances (in A) between interacting atoms are indicated. (E and F) LigPlot diagrams 
of the GluD1-GABA (E) and GluD1-p-serine (F) complexes. CA, a-carbon atom. 
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bottom. Q405E, Gin*°°Glu; Q536Y, Gin°°°— Tyr. (E) Mutations Q405E and Q536Y in the GluN1 LBD confer GABA sensitivity to GluN1/GluN2A NMDA receptors. 
Current traces elicited by successive applications of glutamate plus glycine, GABA alone, and glutamate plus GABA are shown. (F) GABA and glycine (coagonist) dose- 
response curves (in 1 mM glutamate) of GluN1-Q405E-Q536Y/GIuN2A and wild-type (WT) GluN1/GluN2A receptors. The latter receptors are insensitive to GABA (10 mM). Error 
bars indicate SD. Values and sample numbers are provided in table S1. 
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Fig. 4. GluD1 receptors mediate GABAergic synaptic plasticity. (A) A schematic — Scale bars, 2 ym. (©) HFS potentiates IPSCs in control neurons but not in neurons 


of in utero electroporation (IUE) for cell-specific labeling and manipulation of GluD1 
expression in the mouse hippocampus performed at embryonic day 15.5 (E15.5) 

is shown on the left. A confocal image of the hippocampus of a P21 mouse 
electroporated with tdTomato that shows selective targeting of CAl pyramidal 
neurons is shown on the right. Imaging and whole-cell patch-clamp recordings were 
performed at CAl synapses in the SLM, a region enriched in GluD1 receptors 
(15-17). Scale bar, 0.5 mm. (B) GluD1 preferentially associates with inhibitory SLM 
synapses. Colocalization images are shown on the left. The white arrowheads 
highlight the association between GluD1 and the indicated synaptic markers. A 
quantification is shown on the right. The center line represents the median, box 
imits are upper and lower quartiles, and whiskers are minimum and maximum 
values. For gephyrin (GPHN), n = 20 dendrites, and for postsynaptic density protein 
95 (PSD-95), n = 18 dendrites; ***P < 0.001 (two-sided unpaired Student's t test). 
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results on HFS-induced IPSC amplitude. The control group is nonelectroporated 
neurons from the same electroporated mouse. (J) HFS-induced inhibitory 
plasticity is present in serine racemase knockout (SR KO) mice (n = 8 cells) 
(left). A quantification of HFS-induced plasticity in SR KO mice is shown on 


LBD-GABA complex using x-ray crystallography 
(Fig. 2, C and D; fig. S8, A and B; and table S4). 
The GluD1 LBD-GABA complex formed well- 
ordered crystals that diffracted to 1.9-A reso- 
lution, which allowed for the unambiguous 
identification of GABA electron density (Fig. 
2D). GABA binds the central LBD interlobe 
cleft and, typical of iGluR agonists (J, 30), forms 
multiple interactions with both the upper and 
lower lobes (Fig. 2, D and E). In particular, the 
guanidinium group of GluD1-R526 (R, Arg) 
engages in a bidentate salt bridge with the 
carboxyl group of GABA, whereas the y-amino 
group forms a dual electrostatic interaction 
with the carboxylates of GluD1-E446 (upper 
lobe) and GluD1-D742 (lower lobe) (D, Asp). 
Disrupting these interactions prevented GABA 
action on GluDI1 receptors (fig. S3, G to I). Thus, 
charged interactions that include the strictly 
conserved bond between the arginine side chain 
of LBD helix D and the ligand o-carboxylate 
(/, 30) are major anchors for GABA binding 
on GluD1. In addition, three aromatic tyrosine 
(Y) residues line the GABA binding pocket, 
including GluD1-Y492, which acts as a lid on 
top of the GABA molecule (Fig. 2, D and E). 
For comparison, we also solved the structure 
of the GluD1 LBD in the apo state and in the 
presence of p-serine (Fig. 2C; fig. S8, C to H; 
and table S4). The apo GluD1 LBD crystal form 
contained three dimers in the asymmetric 
unit and displayed open cleft conformations 
(Fig. 2C and fig. S9, A to C), which is reminis- 
cent of other apo iGluR LBD structures (J, 30). 
By contrast, the GABA-bound GluD1 LBD 
crystallized as an asymmetric dimer, in which 
LBDs adopt closed-cleft conformations, albeit 
the closure angles differed between the two 
protomers (17.3° and 10.5%, respectively, rel- 
ative to chain E of the apo state; Fig. 2C and 
fig. S8, A to C). The GluD1 LBD-p-serine com- 
plex crystal diffracted to 1.63-A resolution 
and contained one homodimer per asymmetric 
unit (table S4). Only one monomer had p-serine 
bound, and it adopted a closed-cleft confor- 
mation identical to that observed with GABA 
(closure of 17.3° relative to chain E of the apo 
state; fig. S8, D to F). The p-serine interactions 
with GluD1 LBD lobes again involve both ionic 
and hydrogen bonding (Fig. 2F and fig. S8F). 
Overall, the ligand-binding pose is notably sim- 
ilar to that described for p-serine bound to the 
GluD2 receptor LBD (5, 37) (fig. S8). 


Molecular determinants of GABA selectivity 


To examine the molecular basis of GluD1 GABA 
binding and selectivity, we performed structure- 
based mutagenesis experiments. Substituting 
the LBD upper lobe residue E446 to Ser (S), 
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Gln (Q), or Asp (N) abolished GABA effects on 
GluD1-Lurcher currents, but p-serine remained 
active (Fig. 3, A to C, and fig. S10). The more 
conservative E446D (Glu*“°—-Asp) mutant was 
not as discriminating and retained sensitivity 
to both ligands, although with reduced affinity 
(Fig. 3C, fig. S10, and table S1). These results 
are consistent with E446 making ionic inter- 
actions with GABA but not p-serine, which thus 
identifies E446 as pivotal for GABA recognition 
by GluD1. The importance of this residue as a key 
GABA anchor was reinforced by gain-of-function 
experiments performed on (GABA-insensitive) 
NMDA receptors. In the glycine-binding GluN1 
subunit, the equivalent residue of GluD1-E446 
is a glutamine (Q405) (Fig. 3D and fig. S11). 
Introduction of the single-point mutation Q405E 
(Gin*°—Glu) into the GluN1 LBD allowed direct 
activation of GluN1/GluN2A NMDA receptors 
by coapplication of glutamate and GABA (in 
replacement of glycine; fig. S12, A to C). The 
gating efficacy of GABA at NMDA receptors 
was further enhanced by introducing a second 
mutation in GluN1 [mutant GluN1-Q405E- 
Q536Y (Gin™°+Tyr)] to better mimic the GluD1 
GABA binding environment (Fig. 3, D and E, 
and fig. S12C). On GluN1-Q405E-Q536Y/GluN2A. 
receptors, the efficacy of GABA as a coagonist 
reached >50% of that of glycine (Fig. 3F), which 
points to E446 and Y539 as critical determi- 
nants of GABA recognition in GluD1 receptors. 
E446 and Y539 are conserved in GluD2 recep- 
tors (fig. S11), which suggests that other GluD1 
determinants contribute to GABA selectivity 
among delta receptors. 


GABA binding to GluD1 controls inhibitory 
synaptic strength 


We then assessed the impact of GluD1 activation 
on synaptic transmission at mature GABAergic 
synapses. We focused on the stratum lacunosum- 
moleculare (SLM) of the mouse hippocampus, 
a region highly enriched in GluD1 receptors 
(15-17) (Fig. 4A). Confocal microscopy imaging 
revealed that GluD1 preferentially accumu- 
lates at inhibitory synapses rather than at 
excitatory synapses (Fig. 4B), as previously ob- 
served in distal dendrites of cortical pyramidal 
neurons (23, 24). We recorded GABAergic in- 
hibitory postsynaptic currents (IPSCs) from CA1 
pyramidal cells while performing electrical stim- 
ulations in the SLM (fig. S13A). Application of 
the GluD1 agonist p-serine potentiated the IPSC 
amplitudes [ECs9 of 180 + 0.05 uM (7 = 5 to 
16 cells); fig. S13, B to D]. The continuous inclu- 
sion of p-aminophosphovalerate (D-APV) in these 
experiments ruled out a direct activation of NMDA 
receptors by p-serine as an underlying mech- 
anism. Application of 5,7-dichlorokynurenic 


the right. Error bars indicate SEM. Detailed replicate numbers and the full 
statistical analysis are provided in in table S6. For (C) to (I), *P < 0.05, **P < 
0.01, ***P < 0.0001, and ns. is not significant (two-sided Mann-Whitney test); 
error bars indicate SEM. 


acid (DCKA), a ligand that, like p-serine, binds 
iGluR glycine sites (7) and potentiates GluD1- 
Lurcher receptors (fig. S14), also produced 
robust enhancement of IPSC amplitude (fig. 
S13E). Short hairpin RNA (shRNA)-based 
down-regulation of GluD1 expression following 
hippocampus-targeted in utero electropora- 
tion eliminated the p-serine- and DCKA-induced 
potentiation (fig. S13, B and E). It furthermore 
decreased the density of SLM inhibitory syn- 
apses (fig. S13F), as previously shown in the 
cortex (23). This indicates both developmen- 
tal and acute effects of GluD1 signaling on 
GABAergic transmission at SLM CAI syn- 
apses. We next induced short bursts of high- 
frequency stimulation (HFS) to promote the 
release of endogenous GABA from presynaptic 
inhibitory terminals (32). After HFS, IPSC am- 
plitude increased by ~30% in control neurons 
but not in neurons in which GluD1 was knocked 
down, an effect that persisted throughout the 
recording (15 min; Fig. 4C) and likely involved 
a postsynaptic mechanism (fig. S13G). A mo- 
lecular replacement strategy with mutant GluD1 
receptors deficient in distinct signaling modali- 
ties (23) revealed that ligand binding and inter- 
action with the extracellular scaffolding protein 
cerebellin, but not ion flux through GluD1 chan- 
nels, are required for GluD1 to mediate enhance- 
ment of inhibitory neurotransmission (Fig. 
4, D to I, and fig. S13H). Preventing binding of 
GABA, but not of p-serine or glycine, with the 
GluD1-E446S (Glu*“*—Ser) mutant abolished 
HFS-induced plasticity (Fig. 4, H and I, whereas 
inhibitory plasticity was conserved in mice de- 
ficient in p-serine signaling (33) (Fig. 4J). Thus, 
activation of GluD1 receptors at SLM CA1 hippo- 
campal GABAergic synapses induces prolonged 
potentiation of inhibitory synaptic strength 
through a non-ionotropic GABA-induced mech- 
anism that relies on trans-synaptic anchoring. 


Discussion 


Our study unveils that, in the SLM of the mouse 
hippocampus, GluD1 receptors operate at GABAergic 
synapses as receptors that bind GABA and 
control inhibitory synaptic plasticity. We pro- 
pose that synaptically released GABA, together 
with cerebellin, binds to and triggers conforma- 
tional changes of postsynaptic GluD1 receptors, 
which activates intracellular signaling path- 
ways, leading to an increase in the number and/ 
or activity of type A GABA receptors (GABA,Rs). 
This GluD1-driven mechanism of inhibitory 
plasticity, which is embedded in the inhibitory 
synapse itself, departs from previously des- 
cribed mechanisms of inhibitory plasticity that 
commonly rely on cross-talk with neighboring 
excitatory (glutamatergic) synapses (34). By 
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contrast, the non-ionotropic, trans-synaptic, and 
ligand-induced mechanism echoes the GluD2- 
induced plasticity at glutamatergic parallel 
fiber-Purkinje cell synapses in the cerebellum, 
where GluD2 activation by extracellular p-serine 
regulates AMPA receptor content (4, 6, 35, 36). 
The high (10 mM) concentrations of neuro- 
transmitter reached at fast chemical synapses 
after vesicular release (37-39) indicate substan- 
tial activation of GluD1 receptors by GABA 
despite their relatively low affinity for this 
agonist. The detailed mechanisms that couple 
GluD1 activation to the modulation of GABA,R 
activity, however, remain to be deciphered. The 
question of whether this previously unknown 
form of GABAergic synaptic plasticity interacts 
with other forms of inhibitory plasticity (34) 
also remains open. Because GluD1 receptors 
are broadly expressed in the forebrain (15-17), 
including at inhibitory synapses in the neo- 
cortex (23), GABA signaling through GluD1 
receptors likely represents a general mechanism 
that extends the computational rules of inhib- 
itory plasticity with consequences on neuronal 
circuit function. 

In humans, GluD1 mutations are associated 
with susceptibility to autism and schizophrenia 
(4, 40) as well as major depressive disorders 
(42). With their dual ability to reside at excitatory 
and inhibitory synapses, GluD1 receptors are not 
only specially equipped to act as powerful reg- 
ulators of synaptic circuits but are also vulnerable 
nodes of excitation-inhibition imbalance during 
neuropsychiatric disorders. In that context, our 
finding that GluD1 receptors are molecular 
machines with hybrid features—functionally 
GABAergic but structurally glutamatergic— 
opens fresh perspectives on GluD1-targeted 
neuropharmacology. 
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SOCIAL DISPARITIES 


Highlighting health consequences of racial disparities 


sparks support for action 


Riana M. Brown™*+, Pia Dietze”*+, Maureen A. Craig?+ 


Racial disparities arise across many vital areas of American life, including employment, health, and 
interpersonal treatment. For example, one in three Black children lives in poverty (versus one in nine white 
children), and, on average, Black Americans live four fewer years compared with white Americans. Which 
disparity is more likely to spark reduction efforts? We find that highlighting disparities in health-related 
(versus economic) outcomes spurs greater social media engagement and support for disparity-mitigating 
policy. Further, reading about racial health disparities elicits greater support for action (e.g., protesting) 
compared with economic- or belonging-based disparities. This occurs in part because people view health 
disparities as violating morally sacred values, which enhances perceived injustice. This work elucidates 
which manifestations of racial inequality are most likely to prompt Americans to action. 


ovements pursuing social change are 
often sparked and sustained by spe- 
cific events reaching mainstream con- 
sciousness. But which types of events 
are most likely to garner attention and 
support for social change? In 2014, for exam- 
ple, the majority-Black city of Flint, Michigan, 
came into the national spotlight when citizens 
confronted officials over lead leaching in the 
city’s water supply (7). As another example, 
the Black Lives Matter (BLM) movement was 


sparked by the killing of Trayvon Martin, and 
BLM protests often rekindle after subsequent 
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lives are lost. Although brain damage from 
lead exposure, disproportionate use of force, 
and deaths understandably draw attention, 
the systemic nature of racial inequality in the 
United States means that discrimination and 
disparities facing Black Americans are not 
siloed to threats to life and limb or, more gen- 
erally, the domain of physical health (2). For 
example, Flint could have instead captured 
America’s attention because of its high poverty 
rates that eventually made it the most im- 
poverished US city (3); yet, it was notably a 
health-related event that sparked outrage. In 
this work, we empirically investigate how and 
why racial disparities in specific domains— 
economics (e.g., employment), health (e.g., ac- 
cess to hospitals), and feelings of belonging (e.g., 
interpersonal treatment)—differentially influ- 
ence support for mitigating action. 

People support action against situations that 
they perceive as unjust or unfair (4-6). This has 
been found across a variety of social issues, 
including inequality based on race (7-9), gen- 
der (10), and social class (11, 12). Whereas past 
work has documented that a wide array of is- 
sues can elicit injustice perceptions and support 
for action, no empirical work (to our knowledge) 
has tested the relative mobilizing potency of 
different domains within the same group-based 
inequality. Our research provides this vital test 
to assess how and why exposure to different 
manifestations of racial inequality may specif- 
ically spark engagement to mitigate this press- 
ing social issue. 

Disparities between social groups occur in 
a variety of important life domains. Our work 
focuses on racial inequalities in domains that 
social scientists regard as vital to well-being 
and life fulfillment: health, economics, and 
belonging (13-15). Health-related disparities 
occur when social groups differ in their ex- 
posure to violence or other forms of physical 
dangers, disease, or injury or when physiolog- 
ical needs are insufficiently met (16). For ex- 
ample, Black American communities have less 
access to nutritious and healthy foods (/7), 
have fewer hospitals and pharmacies (18-20), 
and experience more physical violence by po- 
lice [racial disparities in police violence are now 
considered and addressed as a public health 
issue (21)]. Economic disparities, by contrast, 
reflect group differences in access to wealth, 
income, jobs, and resources (22). As examples 
of economic-based racial inequality, Black 
Americans, on average, attend lower-funded 
schools, which leads to lower future salaries 
(23); experience greater difficulty acquiring 
jobs and promotions (24, 25); and have less 
accrued family wealth and lower homeown- 
ership rates (78, 26). Finally, belonging-based 
disparities involve group differences in ex- 
periencing positive or negative interpersonal 
treatment (27). For instance, Black Americans, 
on average, face more discipline and suspen- 
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Fig. 1. Raincloud plot with jittered data for participants’ general action support for different disparity 
domains (economic condition, health condition, and belonging condition) in study 1b. Error bars represent 
95% Cls around the means (large black dots). Smaller dots indicate individual participant scores. The “cloud” 


areas reflect the data distributions. 


DASP Lab 
D worsres @ 


Health-based departities between Black and White Americans involve 
expenenoing greater physical danger or being denied acoess to 
pmysiologcal needs 


DASP Lab 
ved @ 


Economic Dased daparities between Black and White Americans 
erolve experiencing greater imancial insecurity of being denied 
access to fmancial means 


NYU Qua 
Health- 


Qualtncs 


Trecs com 


Disparities Persést in America Learn more 


makes sophict 


cated research simple a 


NYU. QUALTINCS.COM 


Economic-Based Disparities Persist in Learn more 
America 


Fig. 2. The health-based and economic-based ads used in study 2. Facebook users (N = 24,952) were 


randomly assigned to view one of the two ads (health 
their news feed, marked as sponsored content. 


sions in schools (28, 29), experience aliena- 
tion with the police (30), and feel unwelcome 
and treated with suspicion in shops and pub- 
lic areas (31, 32). This research investigates 


the extent to which Americans support ef- 


-based ad or economic-based ad) embedded in 


forts to reduce racial disparities in these three 
life domains. 

We posit that health disparities may be viewed 
as particularly pernicious because issues re- 
lated to health and medical care—broadly, 
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Fig. 3. Study 3 serial process model of the indirect effect of disparity condition (health versus 
economic) on policy support through moral sacredness affecting perceived injustice (with 10,000 
bootstrap samples). The value in parentheses represents the total effect before the inclusion of the 


mediators. *P < 0.05; **P < 0.01; ***P < 0.001. 


the protection of human life—are considered 
sacred moral values (33-38). Sacred moral 
values have “infinite or transcendental signifi- 
cance that precludes comparisons [and] trade- 
offs” [(38), p. 853]. In addition, they are often 
considered fundamental or core values (39). In 
support of this notion, in Maslow’s classic 
hierarchy and past empirical work, health- 
relevant physiological needs are considered to 
be more fundamental compared with belonging- 
based or economic needs (13, 14, 40, 41). 

When something is sacred, it is treated as 
morally imperative to protect relative to non- 
sacred, secular concerns, such as economic 
issues [e.g., access to money (33, 37-39, 42)]. 
When a sacred moral value is violated, peo- 
ple report feeling outraged at the moral vio- 
lation, seek to punish the violator, and act 
against the violation (38). Thus, the more fun- 
damental need for health should be perceived 
as more morally sacred compared with less 
fundamental needs of economics and be- 
longing and, as such, should be perceived 
as more unjust if violated. In turn, we predict 
that, on average, people will seek to alleviate 
health disparities over economic or belong- 
ing disparities. 

We tested this hypothesis in four studies, 
using diverse methodologies and including 
three mostly white samples (study la, study 
4a, and study 4b), a targeted sample with 
equal numbers of white and Black Americans 
(study 1b), a sample of social media users (study 
2), and a nationally representative sample (study 
3). We operationalize support for action against 
inequality in several different ways: support 
for individually enacted collective action (e.g., 
attending protests; study 1), institutional 
change (i.e., policy support; studies 3 and 4), 
and actual social media engagement (study 2). 
All study materials are provided in the sup- 
plementary materials, and all data and syntax 
are provided here. All studies followed ethi- 
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cal guidelines and were Institutional Review 
Board (IRB) approved. 


Study 1 


In a within-subjects experimental design, two 
samples (study la: NV = 191 subjects, 73% white; 
study Ib, preregistered at https://aspredicted. 
org/qf39d.pdf: N = 337, 50% white, 50% Black) 
(43) of US citizens read nine examples of racial 
disparities, three each from the domains of 
health (e.g., “Black American communities 
have less access to hospitals, pharmacies, and 
medical care compared with white American 
communities”), economics (e.g., “Black American 
communities experience lower homeowner- 
ship rates, leading to less wealth compared 
with white Americans”), and belonging (e.g., 
“Black Americans feel less welcome in shops 
and public spaces compared with white Am- 
ericans”) (see the supplementary materials for 
all items). After each example, participants re- 
ported whether the disparity was unjust or fair 
[reverse-coded; 2 items; 7 values within each do- 
main ranged from 0.62 to 0.64 (study la) and 
0.83 to 0.85 (study 1b)]. Participants also indicated 
their agreement (from 1, indicating strongly dis- 
agree, to 7, indicating strongly agree) that they 
“personally want to take action to reduce the 
given disparity” (general action). In study 1b, 
participants additionally indicated how likely 
(from 1, extremely unlikely, to 7, extremely likely) 
they were to engage in specific actions: donat- 
ing, engaging in a protest, and sharing infor- 
mation on social media to reduce the disparity 
(concrete actions, 3 items; o values within each 
domain ranged from 0.82 to 0.87). 


Results of study 1 


To assess the effects of domain (study la and 
study 1b) and participant race (study 1b), we 
conducted multilevel models with disparity 
domain (study la and study Ib) and partic- 
ipant race (study 1b) as fixed effects and ex- 


ample type and participant as random effects. 
As predicted and preregistered (study Ib), 
both samples (Fig. 1) revealed significant ef- 
fects of disparity domain on support for action 
[study la: F(2, 1524) = 8.10, P < 0.001; study 1b, 
general action: F(2, 2690) = 40.52, P < 0.001; 
study 1b, concrete actions: F(2, 2690) = 55.85, 
P < 0.001]. Participants reported greater sup- 
port for addressing disparities in health com- 
pared with either economics [study la: B = 0.08, 
1(1524.02) = 3.18, P = 0.001; study Ib, general 
action: B = 0.17, t(2692.00) = 7.64, P < 0.001; 
study Ib, concrete actions: B = 0.12, (2692.00) = 
7.08, P < 0.001] or belonging [study la: B = 0.10, 
(1524.02) = 3.77, P < 0.001; study 1b, general 
action: B = 0.17, (2692.00) = 7.98, P < 0.001; 
study Ib, concrete actions: B = 0.18, (2692.00) = 
10.32, P < 0.001]. There were no differences in 
support for general action between the economic 
and belonging disparities (6 values < 0.02, P > 
0.510); however, participants reported greater 
support for concrete actions (e.g., protesting) to 
reduce economic- compared with belonging- 
based disparities [study Ib: 8 = 0.06, (2692.00) = 
3.24, P = 0.001]. 

In study Ib, we tested for moderation by par- 
ticipant group membership (white versus Black 
Americans), but only the disparity domain main 
effect and a main effect of participant race con- 
sistently emerged: Black Americans overall sup- 
ported addressing all disparities more than white 
Americans, both in general [F, 335) = 12.39, P < 
0.001] and for concrete actions [e.g., protests; 
FA, 335.01) = 15.73, P < 0.001]. Overall, these re- 
sults reveal that both white and Black Americans 
expressed more support for reducing health dis- 
parities compared with other disparities (see 
the supplementary materials for descriptive 
statistics and additional analyses). 

As preregistered (study Ib), greater support 
for action to reduce health (compared with either 
economic or belonging) disparities was mediated 
by perceptions of injustice [study 1b: indirect 
effects > 0.02; 95% confidence intervals (CIs), 
(0.010, 0.048)] (see the supplementary materials 
for comprehensive results). Thus, across two ways 
of operationalizing support for action and among 
participants belonging to both socially ad- 
vantaged and disadvantaged racial groups, 
study 1 provides consistent evidence that white 
and Black Americans support reducing health- 
based racial disparities more than economics- and 
belonging-based disparities. This heightened 
support for action is, in part, because of percep- 
tions that health disparities are more unjust. 


Study 2 


To generalize the findings of study 1 to an 
ecologically valid context, study 2 (preregis- 
tered at https://aspredicted.org/g6dd4.pdf) 
experimentally tested which disparity domain 
garners more engagement on social media. 
Given that the health and economic disparities 
elicited the most and second-most support 
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for mitigation in study 1, we directly compare 
these domains for a conservative test. We cre- 
ated two versions of a sponsored advertisement 
on Facebook [see (12) for similar method- 
ology]. Each ad presented a headline about 
racial inequality—one about health, the other 
about economic disparities—followed by a brief 
description of the issue (Fig. 2). We preregis- 
tered our hypothesis that the health disparities 
ad would garner more social media engagement 
compared with the economic disparities ad. 


Results of study 2 


We ran a split test (randomized experiment) 
to assess which ad was more engaging over 
24 hours [see (72) for details on this metric]. 
As predicted, the health ad “won”: It was more 
engaged with, reached more users, and was 
more cost-effective. The health ad reached 
1156 (9.7%) more people compared with the 
economic ad and had a 95% chance of out- 
performing the economic ad in a replication. 
This finding reveals that even abstract infor- 
mation about racial health disparities—versus 
the specific examples used in study 1—elicits 
greater engagement with the issue in an eco- 
logically valid setting. 


Study 3 


For study 3 (preregistered at https://aspredicted. 
org/j5rc9.pdf), we conceptually replicate studies 
1 and 2 using a new outcome (policy sup- 
port), and we measure a possible precursory 
mechanism—perceived violations of moral 
sacredness—which is hypothesized to spur 
injustice perceptions. A nationally represen- 
tative sample of 1550 US residents (44) viewed 
a short infographic highlighting health-, 
economic-, or belonging-based racial dispari- 
ties. Then, all participants reported on their 
agreement (from 1, strongly disagree, to 7, 
strongly agree) of how unjust (“these disparities 
are unjust”) and morally sacred (2 items, 7 = 0.48; 
e.g., “these racial disparities involve issues or 
values that should never be violated”) the issue 
was as well as their support for general policies 
to reduce it (2 items, 7 = 0.77, e.g.; “politicians 
need to prioritize creating policies that reduce 
these racial disparities”) (see the supplemen- 
tary materials for all items). 


Results of study 3 


An analysis of variance (ANOVA) (see the sup- 
plementary materials for descriptive statistics) 
revealed a marginal main effect of disparity 
domain on policy support [F(2, 1547) = 2.75, 
P = 0.064, n,” = 0.01]. As predicted, planned 
contrasts revealed that participants exposed 
to health disparities supported mitigating pol- 
icies more than those exposed to economic 
disparities [B = 0.14, 11547) = 2.32, P = 0.020]. 
There were no differences in policy support be- 
tween the economic and belonging conditions 
(B = -0.09, P = 0.150) nor, unexpectedly, be- 
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tween the belonging and health conditions (f = 
0.05, P = 0.378). Additionally, we found support 
for the preregistered serial mediation model: 
Highlighting health (versus economic) dispar- 
ities elicited support for mitigating policies as 
a result of perceptions that the issue is morally 
sacred, which, in turn, induced perceived in- 
justice [95% CI, (0.05, 0.13); Fig. 3]. As such, 
with a nationally representative sample, study 
3 finds support that health-based disparities 
(versus economic disparities) are seen as a Vio- 
lation of moral sacredness and are thus unjust, 
which evokes more policy support. 


Study 4 


Although we find greater support for the mit- 
igation of health-based racial disparities com- 
pared with economic disparities (studies 1 to 
3), it is possible that people reveal this pref- 
erence because they view the solutions to ad- 
dress health-based disparities as less costly 
than the solutions to address economic dis- 
parities. Because we assessed general policy 
support in study 3, participants considering 
health disparities (versus economic dispar- 
ities) might have conceived of different policies 
to address each issue. For example, solutions 
to combat racial health disparities, such as 
increasing access to healthcare in specific 
communities, might be perceived as less costly 
for the advantaged group and therefore less 
zero-sum; by contrast, frequently discussed 
solutions to address racial economic dispar- 
ities, such as redistributing financial resources, 
may seem costly. We aimed to address this 
alternative explanation in study 4. Specifi- 
cally, we tested whether our result regarding 
policy support—that people show greater sup- 
port for policy solutions aimed to address racial 
health (versus economic) inequality—replicates 
when assessing support for a concrete policy 
that is consistent across conditions and is costly 
for everyone. 

Additionally, we aimed to extend the po- 
tential applicability of our findings in study 4. 
The social determinants of health approach, 
endorsed by the Centers for Disease Control 
and Prevention, the World Health Organiza- 
tion (45), and other public health agencies (45), 
contends that fiscal economic policy is key 
to the mitigation of health inequality (46). As 
such, we assess support for a specific fiscal 
policy in study 4. 

In two samples, study 4.a (NV = 490, 76% white) 
and study 4b (preregistered at https://aspredicted. 
org/qc7jv.pdf; N = 1088, 74% white), US citi- 
zens indicated their support (from 1, strongly 
oppose, to 7, strongly support) for the same 
fiscal federal policy—increasing taxes by 0.5% 
for all Americans. Crucially, participants were 
randomly assigned to one of two conditions in 
which they either read that this policy would 
help to reduce health-based or economic-based 
racial disparities. 


Results of study 4 

Across samples, as predicted and preregistered 
(study 4b), participants who read that a 0.5% 
tax increase for all Americans would go to re- 
ducing health-based disparities supported 
the increase more than those informed that 
the same tax increase would go to reducing 
economic-based disparities [study 4a: t(488) = 
2.35, P = 0.019, d = 0.21; study 4b: t(1086) = 
4.02, P < 0.001, d = 0.24] (47). This finding 
reveals that Americans express relatively more 
support for increasing taxes if the tax increase 
is aimed at mitigating health (versus economic) 
inequality, even if the proposed policy is held 
constant across conditions, is very concrete 
(versus general policy support in study 3), and 
affects all Americans. 


Discussion 


Across four studies (with six experiments), 
exposure to health disparities spurred greater 
support for taking action compared with eco- 
nomic (studies 1 to 4) or belonging-based dis- 
parities (studies la and Ib). Health disparities 
also garnered more social media engagement 
(study 2), support for general policies (study 3), 
and support for a specific fiscal policy (study 4) 
compared with economic disparities. These ef- 
fects appear driven, at least in part, by percep- 
tions that racial disparities in health outcomes 
violate a sacred moral value (study 3) and are 
more unjust (studies 1 and 3). 

Notably, domains of social inequality are 
clearly highly interrelated; disparities in one 
domain may coincide with or even generate 
disparities in other domains. Consider the issue 
of underfunded public schools in majority- 
Black US neighborhoods: An economic issue— 
lack of public funding—contributes to schools 
providing nutrient-deficient lunches dispro- 
portionately to Black children. Despite their 
interconnected nature, our experiments dem- 
onstrate that highlighting the health conse- 
quences, in particular, will likely garner more 
support to address the issue than if the eco- 
nomic precursor is made salient. Even when 
the solution is economic or fiscal in nature (i.e., 
tax increases), people report more support for 
a policy that addresses health compared with 
economic issues. 

This work provides evidence of the relative 
potency of health-related disparity informa- 
tion; however, this information is not a panacea. 
Research reveals that white Americans are less 
likely to support action to mitigate a health issue 
(e.g., COVID-19) if they are informed that the 
health issue exacerbates racial disparities com- 
pared with if racial disparities are not men- 
tioned [(48); also see (49)]. Consistent with this 
work, participants’ average levels of support in 
our studies suggest hesitance in taking con- 
crete actions or enacting specific policies to re- 
duce racial inequality (participants, on average, 
were neutral or in opposition to concrete action 
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and policy). Nevertheless, this research reveals 
that racial health disparities are among the most 
likely to garner support and that other race- 
based disparities are even less likely to mobilize 
Americans. 

We also found a different pattern of results 
regarding comparisons between health and 
belonging disparities. In study 1, participants 
supported addressing health disparities more 
than belonging disparities, whereas study 3 did 
not find this. Although speculative, it could 
be that the change in operationalizations of 
support (individual action versus policy sup- 
port) or experimental materials (mentioning 
“basic interpersonal needs”) may account for 
this discrepancy. 

The present research also has limitations 
that should be addressed in future inquiries. 
First, the current experiments focus exclusively 
on disparities between Black and white Am- 
ericans. Although we would expect that our 
results would replicate for different dimen- 
sions of inequality (e.g., gender or class), dif- 
ferent disparity domains (e.g., misapplications 
of the law, disparities in political representa- 
tion), as well as disparities between other racial 
groups (e.g., white and Hispanic Americans), 
future research is needed to test this. Additional- 
ly, we did not target other racial minority groups 
(e.g., Asian Americans or Hispanic Americans) 
in recruitment. Thus, we cannot conclude how 
other racial minorities react to Black-white 
disparities. Future work can examine how dif- 
ferent disparities may help or hinder intra- 
minority coalitions. 

Second, although we focus on sacred moral 
values as a mediator, we do not consider this 
the only mechanism. It is likely that additional 
mechanisms contribute to responses to health 
(versus economic or belonging) disparities. 
For example, people may place more blame 
on victims of economic outcomes compared 
with health outcomes because of meritocratic 
worldviews (50). Alternatively, people may per- 
ceive health outcomes as more severe or urgent 
compared with outcomes in other inequality 
domains. Future studies should delve into ad- 
ditional possible mechanisms. 

Overall, the present research represents a 
large-scale investigation of which forms of ra- 
cial disparities spur support for action to re- 
duce racial inequality (and one explanation as 
to why). Research across social science consid- 
ers how activists and leaders of social move- 
ments galvanize involvement by tactically 
choosing how to present or frame an issue (57). 
Thus, the results of the present research could 
be informative for policy-makers, leaders of 
social movements, and citizens seeking to 
reduce racial inequality. Given that the United 
Nations pledged to reduce inequality within 
and among countries by 2030 and called for 
individuals to join the fight (52), this work can 
help us understand why violations of physical 
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needs and safety can be powerful determinants 
of civilian movements for change. 


REFERENCES AND NOTES 


1. M. Smith, J. Bosman, M. Davery, “Flint’s water crisis started 
5 years ago. It’s not over;’ New York Times, 25 April 2019; 
https://www.nytimes.com/2019/04/25/us/flint-water-crisis.html. 

2. S.N. Bleich et al., Health Serv. Res. 54, 1399-1408 (2019). 

3. J. Mack, “Flint and Detroit among nation’s top 5 poorest cities, 
new census data shows,” MLive, 26 September 2019; 
https://www.mlive.com/news/2019/09/flint-and-detroit-among- 
nations-top-5-poorest-cities-new-census-data-shows.html. 

4. J. Jetten, A. lyer, N. R. Branscombe, A. Zhang, Eur. Rev. Soc. 
Psychol. 24, 194-224 (2013). 

5. M. van Zomeren, Soc. Personal. Psychol. Compass 7, 378-388 
(2013). 

6. M. van Zomeren, T. Postmes, R. Spears, Psychol. Bull. 134, 
504-535 (2008). 

7. H. P. Selvanathan, P. Techakesari, L. R. Tropp, F. K. Barlow, 
Group Process. Intergroup Relat. 21, 893-912 (2018). 

8. T.L. Stewart, |. M. Latu, N. R. Branscombe, N. L. Phillips, 

H. Ted Denney, J. Soc. Issues 68, 11-27 (2012). 

9. T.L. Dover, Group Process. Intergroup Relat. 25, 746-767 

(2022). 

0. B. Doosje, N. R. Branscombe, R. Spears, A. S. Manstead, 

J. Pers. Soc. Psychol. 75, 872-886 (1998). 

1. R. M. Chow, J. Galak, Psychol. Sci. 23, 1467-1469 (2012). 

2. P. Dietze, M. A. Craig, Nat. Hum. Behav. 5, 349-360 

(2021). 

A. H. Maslow, Psychol. Rev. 50, 370-396 (1943). 

E. Diener, E. Suh, Soc. Indic. Res. 40, 189-216 (1997). 

J. P. Ruger, Lancet 364, 1075-1080 (2004). 

. Centers for Disease Control and Prevention, “Community 

Health and Program Services (CHAPS): Health Disparities 
Among Racial/Ethnic Populations” (US Department of Health 
and Human Services, 2008). 

7. E. C. Gearan, K. Monzella, L. Jennings, M. K. Fox, Nutrients 12, 

3891 (2020). 
8. E. Badger, “Redlining: Still a thing,” Washington Post, 28 May 
2015; https://www.washingtonpost.com/news/wonk/wp/ 
2015/05/28/evidence-that-banks-still-deny-black-borrowers- 
just-as-they-did-50-years-ago/. 
9. ‘A. Johnson “Lack of health services and transportation impede 
access to vaccine in communities of color,” Washington Post, 
13 February 2021; https://www.washingtonpost.com/health/ 
2021/02/13/covid-racial-ethnic-disparities/. 

20. R. Ray, “Why are Blacks dying at higher rates from COVID-19?” 
(Brookings, 2020); https://www.brookings.edu/articles/why- 
are-blacks-dying-at-higher-rates-from-covid-19/. 

21. J. E. DeVylder, D. M. Anglin, L. Bowleg, L. Fedina, B. G. Link, 
Annu. Rev. Clin. Psychol. 18, 527-552 (2022). 

22. American Psychological Association, “Economic inequality” 
(2022); https://www.apa.org/topics/economic-inequality. 

23. M. Garcia, “Poor black children are much more likely to attend 
high-poverty schools than poor white children” (Economic 
Policy Institute, 2017); https://www.epi.org/publication/poor- 
black-children-are-much-more-likely-to-attend-high-poverty- 
schools-than-poor-white-children/. 

24. E. Patten, “Racial, gender wage gaps persist in U.S. despite 
some progress” (Pew Research Center, 2016); https://www. 
pewresearch.org/short-reads/2016/07/01/racial-gender- 
wage-gaps-persist-in-u-s-despite-some-progress/. 

25. V. Wilson, W. Darity Jr., “Understanding black-white disparities 
in labor market outcomes requires models that account for 
persistent discrimination and unequal bargaining power” 
(Economic Policy Institute, 2022); https://www.epi.org/ 
unequalpower/publications/understanding-black-white- 
disparities-in-labor-market-outcomes/. 

26. R. Manduca, Sociol. Sci. 5, 182-205 (2018). 

27. R. F. Baumeister, M. R. Leary, Psychol. Bull. 117, 497-529 
(1995). 

28. J. Kinsler, Econ. Educ. Rev. 30, 1370-1383 (2011). 

29. J. A. Okonofua, G. M. Walton, J. L. Eberhardt, Perspect. 
Psychol. Sci. 11, 381-398 (2016). 

30. M. Masterson, “Black teens report alienation, fear, in dealings 
with the police” (WTTW News, 2016); https://news.wttw.com/ 
2016/11/25/black-teens-report-alienation-fear-dealings-police. 

31. E. Anderson, Sociol. Race Ethn. 1, 10-21 (2015). 

32. J. Lee, Work Occup. 27, 353-376 (2000). 

33. J. Baron, M. Spranca, Organ. Behav. Hum. Decis. Process. 70, 
1-16 (1997). 

34. G. S. Berns et al., Phil. Trans. R. Soc. B 367, 754-762 (2012). 


DABS 


35. J. Graham, J. Haidt, in The Social Psychology of Morality: 
Exploring the Causes of Good and Evil, P. Shaver, M. Mikulincer, 
Eds. (APA Books, 2012), pp. 11-31. 

36. P. Rozin, S. Wolf, Judgm. Decis. Mak. 3, 325-334 (2008). 

37. R. L. Ruttan, L. F. Nordgren, J. Pers. Soc. Psychol. 121, 

223-1240 (2021). 

38. P. E. Tetlock, O. V. Kristel, S. B. Elson, M. C. Green, J. S. Lerner, 

J. Pers. Soc. Psychol. 78, 853-870 (2000). 

39. P. E. Tetlock, Trends Cogn. Sci. 7, 320-324 (2003). 

40. A. H. Maslow, Motivation and Personality (Harper & Row 

Publishers, ed. 3, 1987). 

Al. E. W. Mathes, J. Humanist. Psychol. 21, 69-72 (1981). 

42. |. Ritov, J. Baron, Organ. Behav. Hum. Decis. Process. 79, 
79-94 (1999). 

43. This is the final sample size after removing participants with 
incomplete responses. Initially, study la had 200 participan 
and study lb had 343 participants. Results remain consistent 
if we retain participants with incomplete responses. 

44. This is the final sample size after removing participants with 
incomplete responses. Initially, study 3 had 1607 participan 
Results remain consistent if we retain participants with 
incomplete responses. 

45. Centers for Disease Control and Prevention, “Social 
Determinates of Health at the CDC” (2022); https://www.cdc. 
gov/about/sdoh/index.html#:~:text=They%20are%20the% 
2Oconditions%20in,clmate%20changey20%20and% 
‘2Opoliticalm20systems. 

46. N. E. Adler, M. M. Glymour, J. Fielding, JAMA 316, 1641-1642 
(2016). 

47. In study 4b, we preregistered one other policy (allocation of 
the federal budget). This item did not yield significant results 
(see the supplementary materials for detailed descriptions 
of all items and analyses). 

48. A. L. Skinner-Dorkenoo et al., Soc. Sci. Med. 301, 114951 (2022). 

49. C. A. Miller, C. L. Wilkins, C. de Paula Couto, J. Farias, 

J. A. Lisnek, Soc. Sci. Med. 320, 115712 (2023). 

50. J. R. Kluegel, E. R. Smith, Beliefs About Inequality: Americans’ 
Views of What Is and What Ought To Be (Routledge, 2017). 

51. R. D. Benford, D. A. Snow, Annu. Rev. Sociol. 26, 611-639 
(2000). 

52. Department of Economic and Social Affairs, “World Social 
Report 2020: Inequality in a Rapidly Changing World” (United 
Nations, 2020); https://www.un.org/development/desa/ 
dspd/wp-content/uploads/sites/22/2020/02/World-Social- 
Report2020-FullReport.pdf. 

53. RM Brown, P. Dietze, MA. Craig, Highlighting health 
consequences of racial disparities sparks support for action, 
dataset, Dryad (2023); https://doi.org/10.5061/ 
dryad.cz8w9gj8t. 


a 


a 


ACKNOWLEDGMENTS 


We thank the TESS program for aid in study 3. Funding: This study 
was supported by National Science Foundation grant NSF-BCS- 
2046842 (M.A.C.). Author contributions: Conceptualization: R.M.B., 
P.D., and M.A.C. Methodology: R.M.B., P.D., and M.A.C. 
Investigation: R.M.B. and P.D. Visualization: R.M.B. Funding 
acquisition: M.A.C. Project administration: R.M.B. and P.D. 
Supervision: P.D. and M.A.C. Writing - original draft: R.M.B. and P.D. 
Writing - review & editing: R.M.B., P.D., and M.A.C. Competing 
interests: The authors declare that they have no competing 
interests. Data and materials availability: All study materials are 
provided in the supplementary materials, and all data and syntax 
are provided at Dyrad (53). All studies followed ethical guidelines 
and were IRB approved (study 1, IRB-FY2018-1870; study 2, IRB- 
FY2018-2070; and studies 3 and 4, IRB-1700). License 
information: Copyright © 2023 the authors, some rights reserved; 
exclusive licensee American Association for the Advancement of 
Science. No claim to original US government works. https://www. 


science.org/about/science-licenses-journal-article-reuse 


SUPPLEMENTARY MATERIALS 


science.org/doi/10.1126/science.adh4262 


Materials and Methods 
Supplementary Text 

Figs. S1 to $3 

Tables S1 to S5 

Study Materials 

References (54, 55) 

MDAR Reproducibility Checklist 


Submitted 2 March 2023; resubmitted 18 August 2023 
Accepted 15 November 2023 
10.1126/science.adh4262 


science.org SCIENCE 


RESEARCH | RESEARCH ARTICLES 


SOLAR CELLS 


Homogenized NiO, nanoparticles for improved hole 
transport in inverted perovskite solar cells 


Shiqi Yu’?+, Zhuang Xiong’?+, Haitao Zhou*, Qian Zhang”, Zhenhan Wang?”, Fei Ma‘, Zihan Qu’, 


Yang Zhao", Xinbo Chu'?, Xingwang Zhang?’, Jingbi You 


1,2% 


The power conversion efficiency (PCE) of inverted perovskite solar cells (PSCs) is still lagging behind that 
of conventional PSCs, in part because of inefficient carrier transport and poor morphology of hole 
transport layers (HTLs). We optimized self-assembly of [4-(3,6-dimethyl-9H-carbazol-9-yl)butyl]phosphonic 
acid (Me-4PACz) onto nickel oxide (NiO,) nanoparticles as an HTL through treatment with hydrogen 
peroxide, which created a more uniform dispersion of nanoparticles with high conductivity attributed to 
the formation of Ni** as well as surface hydroxyl groups for bonding. A 25.2% certified PCE for a 
mask size of 0.074 square centimeters was obtained. This device maintained 85.4% of the initial PCE 
after 1000 hours of stabilized power output operation under 1 sun light irradiation at about 50°C 

and 85.1% of the initial PCE after 500 hours of accelerated aging at 85°C. We obtained a PCE of 21.0% 
for a minimodule with an aperture area of 14.65 square centimeters. 


etal halide perovskite solar cells (PSCs) 
with inverted (p-i-n) structures have 
achieved certificated power conversion 
efficiencies (PCEs) as high as 24% and 
have better long-term stability under light 
and heat stresses compared with the more efficient 
n-i-p structures (7-4). Organic materials such as 
poly[bis(4-pheny1)(2,4,6-trimethylphenyl)amine] 
(PTAA), which are often used as hole transport 
layers (HTLs), suffer from low conductivity, hydro- 
phobicity, and energy-level mismatch (5-7). 

Recently, self-assembled molecules (SAMs) 
such as [4-(3,6-dimethyl-9H-carbazol-9-yl)butyl] 
phosphonic acid (Me4PACz) and [2-3,6-dimethoxy- 
9H-carbazol-9-yl)ethyl]phosphonic acid (MeO- 
2PACz) have been successfully used in PSCs 
as HTLs to achieve inspiring PCEs (8, 9). How- 
ever, SAM bonding with transparent conduc- 
tive oxide substrates such as indium tin oxide 
(ITO) and fluoride-doped tin oxide (FTO) may 
not be strong enough to create a homogeneous 
distribution on the substrate needed to avoid a 
broad device-performance distribution (8, 10). 
Several recent studies showed that p-type nickel 
oxide (NiO,) could form stronger chemical 
bonding with the SAM layer compared with 
ITO or FTO, help to deposit homogeneous SAM 
(10-12), and deliver better device performance 
and reproducibility (73, 14). 

A high-quality p-type NiO,, HTL needs to be 
compact to ensure better SAM deposition qual- 
ity and reduce interface recombination, and 
also need to be highly conductive for better charge 
transfer. We developed a simple dopant-free 
method to modulate the NiO,, nanoparticle (NP) 
solution by adding H,0O, solvent. Incorporation 
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of H,O, improved the dispersion of NiO, NPs 
and avoided particle aggregation that would 
inhibit full coverage of the transparent conduc- 
tive oxide (TCO) substrate. This approach also 
increased the ratio of Ni®* in NiO, and formed 
NiOOH, a stable component that also has high 
conductivity. The Me-4PACz SAM that formed 
homogeneously on the modulated NiO, sur- 
face was more favorable for the growth of the 
perovskite and deposition of uniform large- 
area films by reducing the contact angle, which 
improved charge extraction and lowered leak- 
age currents. As a result, the modulated NiO, 
combined with SAM bilayer-based devices ex- 
hibited an efficiency of 25.5% (Newport cer- 
tified stabilized efficiency: 25.2%), and the 
0.1-cm? devices retained 85.4 and 85.1% of the 
initial efficiency after 1000 hours of operation 
under AM1.5G light illumination at 50°C and 
500 hours of continuous thermal stress at 85°C, 
respectively. In addition, a large-area device of 
14.65 cm” had a PCE of 21%. 


HTL layer synthesis and characterization 


We observed that H,O, could interact with com- 
mercial NiO,, NPs (nominal size 10 nm) and 
improve the dispersion in aqueous solvent 
(figs. S1 and S2). High-resolution transmission 
electron microscopy showed that the control 
NiO, NPs aggregated and their typical size 
was >10 nm (Fig. 1A). After H)O2 addition, the 
aggregation was suppressed and the single 
particle size also decreased to ~5 nm (Fig. 1B). 
Dynamic light scattering confirmed that the 
statistically average dispersion NP size de- 
creased from 12 to 9 nm after HO, treatment, 
with a much narrower particle size distribu- 
tion (Fig. 1C). The formation of hydrated NiO, 
through reaction with H,O, likely inhibited 
NiO, NP aggregation, enhanced dispersion, 
and decreased the NP size (15). 

The effect of the NP size and dispersion dis- 
tribution on the growth of NiO, films was re- 


flected in the morphologies of the NiO, films 
deposited on the FTO substrate. The H.O.- 
modulated NiO, film was more compact and 
uniform compared with the control sample. 
Cross-sectional scanning electron microscopy 
(SEM) images (fig. S3) revealed that the mod- 
ulated NiO, film was dense and conformal. 
Cross-sectional transmission electron micros- 
copy and energy dispersive spectroscopy map- 
ping results (fig. S4) further confirmed this 
finding. The root-mean-square roughness of 
the NiO, film was reduced from 4.4 to 3.2 and 
to 2.9 nm after adding 20 and 40 mol% H2O., 
respectively (fig. S5). The improvement of the 
NiO, morphology was expected to reduce the 
leakage current and recombination when com- 
bined with SAMapplied as HTLs in perovskite 
devices. 

X-ray photoelectron spectroscopy (XPS) was 
performed to investigate the chemical states 
of the deposited NiO, thin film. For the NiO, 
film, with and without H,O, modulation, the 
Ni 2p spectrum showed peaks at 857.1, 855.8, 
and 854.1 eV, representing the Ni**, Ni**, and 
Ni** species, respectively (Fig. 1, D and E) (16). 
Usually, Ni?* originates from the Ni vacancies 
and mainly distributes in the grain boundaries 
of nonstoichiometric oxygen-rich NiO, (7, 18), 
which is beneficial for hole transport. As illus- 
trated in the Ni 2p spectrum, the increase in the 
ratio of Ni?* from 31.3 to 38.3% was evidence 
for the generation of Ni vacancies in the H,O.- 
modulated sample. In the oxygen 1s spectrum 
(fig. S6), we assigned peaks at binding energies 
of 529.5, 531.0, 532.2, and 533.2 eV to the NiO, 
hydroxide (-OH), Ni**-O, and Ni**-O, respec- 
tively (16). The ratio of -OH decreased from 29.6 
to 22.4% in the H,O,-modulated sample. Con- 
versely, the ratio of Ni?*-O increased from 11.2 
to 18.35%, likely caused by the transition from 
Ni(OH), to NiOOH in H.O.-modulated NiO,, 
film. The presence of NiOOH not only improved 
the conductivity of NiO, films (19), but also pro- 
vided more surface hydroxyl groups for SAM 
bonding (20). 

Both Fourier transform infrared (FTIR) and 
Raman spectroscopy further verified the con- 
version from Ni(OH), to NiOOH. The FTIR 
absorption peaks at 410 and 500 cm’ in the 
control NiO, film, which we attributed to the 
bending vibration modes of y(Ni-OH) and 
d5(Ni-OH), respectively (21, 22), were absent 
in the H,O.-modulated NiO, sample, indicat- 
ing that the reaction Ni(OH), occurred inside 
the treated film (Fig. 1F). In the Raman spec- 
trum (fig. $7), the intensity of 2LO (1075 cm) 
was suppressed in the H,O,-modulated NiO,. 
The intensity of 2LO is inversely proportional 
to the concentration of Ni?* inside NiO, (23), 
so this result is consistent with increased den- 
sity of Ni?* in the H.O.-modulated samples. 

Conductive-mode atomic force microscopy 
results also indicated that the conductivity of 
NiO, increased substantially after modulation 
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Fig. 1. Characterization of the NiO, nanoparticles. (A and B) High-resolution transmission electron microscopy images of the control and modulated NiO, 
nanoparticles, respectively. (€) Size distribution of control and modulated NiO, nanoparticles in solvent, which was measured by dynamic light scattering. 
(D and E) XPS spectra of Ni 2p core levels for control and H202-modulated NiO, films, respectively. (F) FTIR spectra of control and modulated NiO, nanoparticles in 


the 400 to 700 cm™ wavenumber region. 


(fig. S8). For the NiO, film mobility, it increased 
from 2.24 x 10* to 2.03 x 10° cm? V" s“ after 
treatment based on space-charge-limited cur- 
rent measurement (fig. S9) (24). Both the con- 
trol and modulated NiO, were able to retain 
their original electrical properties after aging 
at 85°C for 100 hours (fig. S10). 

Kevin probe microscopy (KPFM) was used 
to measure the surface potential distribution 
of Me-4PACz on the different substrates (FTO, 
FTO/control NiO,, and FTO/modulated NiO,, 
respectively; Fig. 2, A to C). The line profile 
comparison of KPFM is depicted in Fig. 2, D 
to F. Although Me-4PACz could cover most 
of the FTO substrate, some inhomogeneous 
regions were present. The SAM layer cover- 
age was more uniform after inserting a NiO, 
buffer layer, which could be caused by bonding 
between the phosphoric acid group and surface 
hydroxyl groups of the NiO, through strong 
P-O---Ni or P=O---Ni bonds (20). As expected, 
Me-4PACz was almost fully uniformly assem- 
bled on the FTO/modulated NiO, substrates, 
indicating that the condensed and uniform 
NiO, layer with more surface hydroxyl groups 
provided a better site for the growth of a high- 
quality SAM layer. 

XPS confirmed improved chemical absorp- 
tion of SAM layer on the NiO, substrates (Fig. 
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2, G to I). For the SAMs deposited on the FTO 
substrate, the P 2py/2 and 2ps,. core energy level 
was observed at 133.9 and 133.1 eV, respectively, 
indicative of Me-4PACz binding (25). The pres- 
ence of an Sn 4s peak at 138.9 eV indicated in- 
complete coverage of the FTO surface. After 
inserting the NiO, layer, the P 2p peak integrated 
intensity increased by 2.35 times compared with 
the FTO/SAM sample, indicating a dense and 
uniform deposition of SAM on the NiO, sub- 
strate. The integrated intensity of P 2p increased 
3.0 times on modulated NiO, substrates com- 
pared with that deposited on FTO directly. In 
addition, the Sn 4s peak had almost disap- 
peared, consistent with full coverage of mod- 
ulated NiO, on the FTO substrate. 

The energy levels for a very thin film can vary 
with the thicknesses and also be affected by the 
underlying substrate (26). Ultraviolet photo- 
electron spectroscopy was used to measure 
the highest occupied molecular orbital (HOMO) 
of Me-4PACz deposited on different surfaces, 
which were -5.35, -5.62, and -5.68 eV for the 
FTO, control NiO,, and modulated NiO, sur- 
faces, respectively (fig. S11). The value of HOMO 
levels is more consistent with the intrinsic prop- 
erties of Me-4PCAz (27) when depositing on 
NiO, surface, which could be the result of bet- 
ter SAM coverage on NiO,. We also measured 


the corresponding energy level of nearly pure 
FAPDbI; we used. The calculated valence band 
of perovskite at -5.67 eV (fig. S12), was near 
the HOMO level of the modulated NiO,/SAM 
interface (-5.68 eV). These band alignment 
values could lead to small energy loss at the 
interface for the solar cell devices (fig. S13 
and table S1). 

We also measured contact angles to study 
the wetting properties of the different surfaces 
(Fig. 3, A to C). The perovskite precursor on 
Me-4PACz-only substrates had a large contact 
angle of 39.89 which is consistent with pre- 
vious reports that perovskite deposition on the 
Me-4PACz surface is relatively challenging 
(20, 28). The contact angle decreased to 27.4° 
on the control NiO,/Me-4PACz surface and 
further decreased to 20.6° on the modulated 
NiO,/Me-4PACz surface, indicating that the 
NiO, buffer layer enabled better wetting for 
perovskite growth. Improved hydrophilicity 
could be caused by the formation of homoge- 
neous SAM (9) or NiO, with more hydroxyl 
groups (fig. S6). The top perovskite layers had 
similar morphologies (fig. S14). We also exam- 
ined the morphologies of the buried surface 
of perovskite layer in contact with the charge 
transport layer by peeling off the perovskite 


layer from the substrate (9). Obvious morphology 
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Fig. 2. Characterization of the SAM and NiO,/SAM layers. (A to ©) KPFM images of surface potential of Me-4PACz, control-NiO,/Me-4PACz, and modulated- 
NiO,/Me-4PACz, respectively. (D to F) Corresponding line profiles of Me-4PACz, control-NiO,/Me-4PACz, and modulated-NiO,/Me-4PACz, respectively. (G to 1) XPS 
spectra of P 2p core levels for Me-4PACz, control-NiO,/Me-4PACz, and modulated-NiO,/Me-4PACz films, respectively. 


defects such as pinholes were observed for 
the perovskite film deposited on the Me-44-PACz- 
only surface, which could be caused by the 
dewetting of the Me-4PACz surface (Fig. 3D). 
After inserting the NiO, layer between SAM and 
FTO, the bottom surface of perovskite became 
denser and the crystal was more homogeneous 
(Fig. 3, E and F). 

We studied the charge transfer and the re- 
combination properties at the perovskite bu- 
ried interface using photoluminescence (PL) 
and time-resolved photoluminescence (TRPL) 
measurements with the stack of FTO/HTL/ 
perovskite; for comparison, the results of pe- 
rovskite directly deposited on FTO were also 
included (Fig. 3, G and H). The PL intensity 
of perovskite deposited on SAM surface was 
greater than that on the FTO surface (Fig. 3G), 
indicating that the SAM could passivate the 
buried surface of the perovskite layer (8). The 
PL intensity gradually increased with the cov- 
erage of the SAM, inferring that the more com- 
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pact SAM surface provides more passivation 
sites for the perovskite layer. In addition to 
the PL enhancement, a blue shift from 815 to 
809 nm was observed, further confirming sup- 
pression of the interface recombination. 

The perovskite TRPL results could be fit by 
two exponent decays, except for the film on 
FTO directly (Fig. 3H). The fast decay was at- 
tributed to the charge extraction (29). The life- 
times were 54, 46, and 42 ns for Me-4PACz, 
control NiO,,/Me-4PACz, and modulated NiO,,/ 
Me-4PACz, respectively, indicating that the hole 
extraction became more efficient after insert- 
ing and modulating the NiO,. This improvement 
could have resulted from the better coverage 
of the SAM layer (Fig. 2) and the enhanced 
electronic properties of the NiO, (fig. S9). A 
longer decay time was observed for all perov- 
skites on SAM layers; the perovskite on SAM 
only had a lifetime of 0.7 us, but the lifetime 
was 2.3 us for the modulated NiO, surface 
(Fig. 3H), indicating that the fully covered 


Me-4PACz efficiently passivated the buried 
interface of the perovskite. 


Photovoltaic properties 


We fabricated the devices with the architecture 
of glass/FTO/NiO,,/Me-4PACz/FAPbI3/PCBM/ 
SnO./Cu, where PCBM is [6,6]-phenyl-Cg-butyric 
acid methyl ester. The fabrication details can be 
found in the supplementary materials. FAPbI; 
was used as the perovskite absorber with a 
bandgap around 1.52 eV, and the thickness of 
perovskite film was ~’700 nm with large grains 
spanning the film. The grain size was ~1 um, 
as determined from the cross-sectional and 
top-view SEM images (figs. S14 and S15). The 
NiO, layer improved the device performance 
(Fig. 4A). For the device based on Me-4PACz 
without NiO,, the PCE for an illuminated area 
of 0.1 cm” was 22.2% with an open-circuit volt- 
age (Voc) of 1.11 V, short circuit current (Jgc) of 
26 mA/cm”, and fill factor (FF) of 77.1%. For de- 
vices with the NiO, layer, the Voc and FF increased 
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to 1.14 V and 81.1%, respectively, and a PCE of 
24.1% was achieved. For PSCs based on H,0.- 
modulated NiO,, a PCE up to 25.5% was achieved 
with a Voc of 1.16 V, aJgc of 26.15 mA/cm?, and 
an FF of 84.1% (Fig. 4A). We also found that 
the modulated NiO,/SAM devices showed neg- 
lected hysteresis compared with the NiO, with- 
out modification (Fig. 4B and fig. S16). The 
detailed device parameters are listed in tables 
$2 and S3. To show that the enhancement 
after NiO, insertion is universal, we replaced 
the substrates and self-assembled molecule 
by using ITO and MeO-2PACz, respectively, 
and a similar trend was observed even though 
the device performance was not optimized (figs. 
S17 and S18 and tables S4 and S5). 

The typical external quantum efficiency re- 
sult of the device is shown in fig. $19, which is 
almost consistent with the Jsc from current 
density-voltage (J-V) sweeps. One of our best- 
performing PSCs was sent to an accredited 
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laboratory (Newport Corporation) for certi- 
fication. The device with an illuminated mask 
area of 0.074 cm? delivered a certified quasi- 
steady-state efficiency of 25.2% with a Voc of 
118 V, Jsc of 26.2 mA/cm?, and FF of 81.2% 
(fig. S20). Compared with the in-house mea- 
surement results, we found that the certifi- 
cated results showed a 2% reduction in FF, 
which could be caused by the contact degra- 
dation during the transportation. Moreover, 
we collected the PV parameters of 50 devices for 
each condition (control and H,0,-modulated) 
in fig. $21. The high reproducibility was evi- 
denced by the narrow PCE distribution, and 
the average PCE with modulation increased 
from 23.5 to 25.0%, which mainly reflected 
increases in Voc and FF. 

The reverse leakage current density was re- 
duced by more than one order after the mod- 
ulation of NiO, via H2Oz, further indicating 
that recombination at the buried surface was 


suppressed (Fig. 31). The current density of the 
NiO,-modulated sample under forward bias 
was higher than that of the SAM-only and con- 
trol NiO, samples and reflects better charge- 
transport properties. 

We measured the transit photovoltage to 
compare charge recombination (fig. $22). The 
decay time increased from 1.44 1s for the SAM- 
only device to 2.02 us for modulated NiO,/ 
SAM-based devices, indicating that the recom- 
bination had been suppressed, which was con- 
sistent with the TRPL results (Fig. 3H). We also 
measured the Voc-light intensity dependence 
(fig. S23) and found that the ideality factor 
for the device without NiO, was 1.71, an in- 
dicator of substantial recombination in the 
SAM-only device. With the NiO, layer, the ideal 
factor decreased to 1.50 and decreased to 1.22 
for modulated NiO,-based devices, which is 
near the value of 1.14 for the champion nor- 
mal structure (n-i-p) devices (30). 
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was applied at maximum power point with continuou 
of ITO was used as a buffer layer between SnOQ>2 and 


We also studied the electroluminescence 
performance of the devices working as light- 
emitting diodes. The modulated NiO, device 
had a 2.53% external quantum efficiency ver- 
sus 0.32% for the Me-4PACz device (fig. S24). 
This ~8 factor electroluminescence efficiency 
enhancement was consistent with a 50 meV 
Voc increase (Fig. 4A). We conclude that in- 
crease in Voc was mainly caused by suppres- 
sion of the defect-related recombination. For 
the FF and hysteresis improvement, the better 
physical contact from improved wetting (Fig. 2, 
A to C) and improved charge transport from 
better band alignment (fig. S13) or enhanced 
NiO, conductivity (fig. S9) could also play a role. 


Large-area devices and device stability 


We fabricated the minimodules with six subcells 
with an aperture area of 14.65 cm”. The struc- 
tural schematic and photomicrograph of the 
subcell interconnection region are shown in fig. 
$25, and details of the minimodule fabrication 
process can be found in the experimental sec- 
tion. The device parameters were as follows: Voc 
of 6.9 V connected in series, Jsc of 3.87 mA/cm”, 
and FF of 78.7%, resulting in a PCE of 21% (Fig. 


4C and table S6), which is among the highest 
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Cu. 


efficiencies for the large-area PSCs. On the basis 
of the geometry design and the SEM images of 
subcell interconnection region (fig. S25), the 
geometric FF can be calculated as 94.5%, which 
still has some room for improvement. Com- 
pared with the small device, the performance 
loss for the minimodule is mainly from the Jgc 
and FF, and it can be further improved by de- 
positing more uniform perovskite layer. 

To determine the device stabilities, we first 
compared the stability of NiO,-based devices 
without the SAM layer using the small-size 
device (0.1 cm”). The H,O.-modulated strategy 
had slightly better stability both in stabilized 
power output tracking at 50°C and accelerated 
aging testing at 85°C (fig. S26), which could be 
due to the contact improvement (/6). The de- 
vice with the modulated NiO,/Me4PACz bilayer 
(small-size device) maintained ~79% of its initial 
PCE of 24.1% at maximum power point tracking 
under 1 sun illumination at 50°C (figs. S27 and 
$28). The device used for stability did not have 
an antireflection coating, leading to a 2% loss of 
Jgc compared with the champion devices. 

We then fabricated a device in which we 
deposited a 5-nm layer of ITO between the 
Cu electrode and SnO, to avoid diffusion. This 


device retained 85.4% of its initial 23.6% 
PCE (fig. S29A) after continuous operation for 
1000 hours at 50°C (Fig. 4D), tested under the 
continuous bias voltage of 1.02 V, which is the 
output voltage at the maximum power point. 
The enhanced stability of the device after in- 
serting the ITO buffer layer could be caused 
by suppression of ion migration under exter- 
nal stress. 

We also investigated the stability of the mini- 
module. After 400 hours of stabilized power 
output tracking, the device still showed 90% of 
its original PCE of 20.5% under 1 sun illumi- 
nation at 50°C (figs. S30 and S31). The small- 
size unencapsulated device with an initial PCE 
of 23.5% without antireflection coating (fig. 
S29B) was subjected to accelerated aging sta- 
bility by being heated at 85°C in nitrogen con- 
ditions. The device retained 85.1% of the initial 
PCE after 500 hours (fig. S32). 


Discussion 


Self-assembled molecules are widely used for 
designing efficient inverted perovskite solar 
cells, but inhomogeneous SAM could lead to 
leakage and recombination, both of which limit 
the device’s performance and reproducibility. 
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We incorporated homogeneous NiO, nano- 
particles for better SAM growth based on this 
hole-selecting layer, and small-size PSCs with 
a certified efficiency of 25.2% and a 14.6-cm” 
minimodule with 21% efficiency were obtained; 
these configured devices exhibited good stabil- 
ity. Further improvement in device performance 
could be achieved by optimizing the buried sur- 
face and the top surface. Depositing a more 
uniform perovskite layer will be helpful for 
further enhancing module device performance. 
In addition to solar cells, this bilayer HTL could 
be used in other optoelectronic devices such as 
light-emitting diodes and photodetectors. 
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STRUCTURAL BIOLOGY 


Rab29-dependent asymmetrical activation 
of leucine-rich repeat kinase 2 


Hanwen Zhu+t, Francesca Tonelli7*+, Martin Turk’, Alan Prescott’, Dario R. Alessi*°, Ji Sun’* 


Gain-of-function mutations in LRRK2, which encodes the leucine-rich repeat kinase 2 (LRRK2), are the most 
common genetic cause of late-onset Parkinson's disease. LRRK2 is recruited to membrane organelles and 
activated by Rab29, a Rab guanosine triphosphatase encoded in the PARKI6 locus. We present cryo-electron 
microscopy structures of Rab29-LRRK2 complexes in three oligomeric states, providing key snapshots during 
LRRK2 recruitment and activation. Rab29 induces an unexpected tetrameric assembly of LRRK2, formed by 
two kinase-active central protomers and two kinase-inactive peripheral protomers. The central protomers 
resemble the active-like state trapped by the type | kinase inhibitor DNL201, a compound that underwent a 
phase 1 clinical trial. Our work reveals the structural mechanism of LRRK2 spatial regulation and provides 
insights into LRRK2 inhibitor design for Parkinson's disease treatment. 


arkinson’s disease (PD) is the second most 

prevalent neurodegenerative disorder, af- 

fecting 1 to 2% of the population over the 

age of 65 (7). Mutations in the LRRK2 gene, 

which encodes the leucine-rich repeat 
kinase 2 (LRRK2) protein, are among the most 
frequent genetic causes of late-onset PD and ac- 
count for ~5% of familial and ~1% of sporadic 
cases (2-4). More than 250 mutations in LRRK2 
have been identified, and ~100 were biochem- 
ically characterized (5-7). Most LRRK2-related 
PD mutations, such as G2019S (Gly*""°—Ser), 
have increased kinase activity. Therefore, LRRK2 
inhibitors are of great interest to researchers 
in PD treatment (2). 

Much effort has been directed at exploring the 
structure-function relationship in LRRK2 to 
gain a mechanistic understanding and to guide 
rational drug discovery (8). Previous work re- 
vealed high-resolution structures of inactive 
LRRK2 (9, 10), whereas our understanding of 
LRRK2 activation is limited to microtubule- 
associated LRRK2 structures of disease muta- 
tions at mediate and low resolutions (11, 12) or 
molecular dynamics (MD) simulations in com- 
bination with hydrogen-deuterium exchange 
mass spectrometry (HDX-MS) analyses (13-16). 
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Furthermore, LRRK2 activation in both physio- 
logical and pathogenic conditions is tightly as- 
sociated with membrane recruitment and Rab 
guanosine triphosphatases (GTPases) (17-19), 
the molecular basis of which remains enigmatic. 

This study aimed to explore the kinase ac- 
tivation mechanism of LRRK2 that is induced 
by Rab29, a membrane-anchored Rab GTPase. 
Rab GTPases are master regulators of intra- 
cellular-vesicle trafficking, whose disruption 
is a hallmark of PD pathogenesis (20). Rab29, 
encoded in the PARKI6 locus and associated 
with late-onset PD, is believed to function in 
the same pathway as LRRK2. Although mouse 
genetic data suggest that alternative activation 
mechanisms of LRRK2 exist (27), physiologi- 
cal connections between Rab29 and LRRK2 
are supported by human and Caenorhabditis 
elegans genetic data, similar mouse knockout 
histological phenotypes, cellular colocaliza- 
tion, and physical interactions (22-27). More- 
over, Rab29, but not its close homologs Rab32 
or Rab38, stimulates LRRK2 kinase activity by 
monitoring $1292 autophosphorylation (28), 
suggesting that Rab29 does more than mem- 
brane recruitment to stimulate LRRK2 activity. 


Structural determination of 
Rab29-LRRK2 complexes 


To structurally characterize the Rab29-LRRK2 
interaction, we first reconstituted a stable com- 
plex in vitro. Previous studies have suggested 
an emerging “Rab29-LRRK2-Rabs” cascade 
for LRRK2 signaling (Fig. 1A) (9, 29), in which 
GTP-bound Rab29 (Rab29°GTP) facilitates 
LRRK2 membrane recruitment and activation, 
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then activated LRRK2 phosphorylates Rab 
GTPases, including Rab29 itself. Phosphoryl- 
ated Rab29, which was reported not to activate 
LRRK2, could potentially serve as negative feed- 
back (28). We thus introduced three point muta- 
tions (Q67L, T71A, and S72A) to Rab29 (Rab29g,) 
to maximize the opportunity of capturing the 
active Rab29eGTP-LRRK2 complex (where Q 
is glutamine, L is leucine, T is threonine, A is 
alanine, and S is serine). T71A and S72A pre- 
vent Rab29 phosphorylation, and Q67L abolishes 
GTPase activity and enhances the interaction 
between Rab29 and LRRK2 (26). Although 
Q67L also diminishes the membrane localiza- 
tion of Rab29 in cells (26), we reasoned that 
the interaction between Rab29 and LRRK2 is 
independent of membrane environment or 
composition (30) and should not be affected 
in vitro by this mutation. Indeed, glutathione 
S-transferase (GST) pulldown assays validated 
the Rab297),-LRRK2 interaction and complex 
formation (fig. S2G). 

We determined cryo-electron microscopy 
(cryo-EM) structures of the Rab29-LRRK2 com- 
plex, reconstituted by mixing LRRK2, Rab29;,, 
GTP analog (GppNHp), adenosine triphosphate 
(ATP), and Mg". The final cryo-EM reconstruc- 
tion resulted in Rab29-LRRK2 structures of 
three distinct oligomeric assemblies (Fig. 1, B 
and C; fig. S1; and table S1): Rab29-LRRK2 
monomer with one LRRK2 and one Rab29; 
Rab29-LRRK2 dimer with two LRRK2 and 
two Rab29; and Rab29-LRRK2 tetramer with 
four LRRK2 and two visible Rab29 (Fig. 1, B and 
C). Focused three-dimensional (3D) refinement 
improved the cryo-EM density surrounding the 
Rab29-LRRK2 interface (fig. SIA), and C2 sym- 
metry was imposed for Rab29-LRRK2 dimer 
and tetramer during data analysis (fig. SLA). 


Rab29-dependent recruitment of LRRK2 


In the Rab29-LRRK2 monomer, LRRK2 is al- 
most identical to the inactive LRRK2-alone 
structure (J0) (fig. S2A). Rab29 binds to the 
N-terminal armadillo repeat (ARM) domain 
of LRRK2, burying a surface area of ~800 A” 
(Fig. 2A), and adopts a GTP-bound Switch I 
closed configuration often observed in GTP- 
bound small GTPases (fig. S2, B to E). Docking 
of the GDP-bound Rab29 structure (25) into 
the cryo-EM density shows obvious steric clashes 
with LRRK2 (fig. S2F), explaining why the GTP- 
bound state promotes Rab29-dependent recruit- 
ment of LRRK2 (24, 25, 27, 28, 30, 31). 

The Rab29-LRRK2 interface is formed by 
the ARM9-10 of LRRK2 and the Switch I- 
Interswitch-Switch II surface and CDR1 (32) of 
Rab29 (fig. S2E). The interaction conforms with 
a general Rab-effector recognition mode, in 
which effectors associate with the GTP-bound 
form of Rabs through the Switch I-Interswitch- 
Switch II surface (32). Sequence alignment of 
Rab GTPases, including the Rab32 subfamily 
(Rab29, Rab32, and Rab38) and several LRRK2 
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substrates, revealed that key residues in the 
Rab29-LRRK2 interface were conserved among 
the Rab32 subfamily (fig. S2H). Single mutations 
at the center of the interface (Rab29 D43A or 
W62A) were sufficient to abolish the interaction, 
whereas Rab29 L7Q or L76M substitutions at 
the edge of the interface had moderate or little 
impact (fig. S2G). The surfaces of Rab29 and 
Rabl0 predicted to interact with LRRK2 had 
almost identical interface residues, which is 
consistent with previous work suggesting that 
Rabi0 interacts with LRRK2 at the Rab29 
site (33). Rab5A, 5B, and 5C, which can be phos- 
phorylated by LRRK2 (34), contain an alanine in 
the position corresponding to Asp“? in Rab29 
(fig. S2H). 

Consistent with the pulldown-assay results, 
confocal microscopy revealed that Rab29 D43A 
or W62A mutations diminished the membrane 
recruitment of LRRK2, whereas Rab29 L7Q or 
L76M substitutions had moderate or little dis- 
ruptive effects (Fig. 2B and fig. S3A). Similarly, 
LRRK2 mutations (R399E and L403E) at the 
center of the Rab29-LRRK2 interface signifi- 
cantly reduced the Rab29-dependent mem- 
brane recruitment, whereas a mutation at 
the periphery of the interface (M402A) had a 
moderate impact (Fig. 2C and fig. S3B). We 
also assessed the effect of interface mutations 
on LRRK2 kinase activity in cells by monitor- 
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ing the phosphorylation status of Rab10-Thr”, 
Rab29-Thr”, and LRRK2-Ser”*” (28). In ag- 
reement with the pulldown and membrane- 
localization results, mutations at the center of 
the interface on either the Rab29 (Fig. 2D and 
fig. S3, C and E) or LRRK2 side (Fig. 2E and 
fig. S3D) abolished the Rab29-stimulated ki- 
nase activity, whereas mutations at the pe- 
riphery had only a minor impact. These data 
confirmed the observed Rab29-LRRK2 inter- 
face and suggested the importance of mem- 
brane recruitment for LRRK2 activation. All 
mutations introduced to the Rab29-LRRK2 
interface had minor impacts on the basal ac- 
tivity of LRRK2 on Rab10 and Ser”, support- 
ing the view that the observed interface is 
important for LRRK2 recruitment but prob- 
ably not for substrate recognition. 

In the Rab29-LRRK2 dimer assembly, each 
LRRK2 protomer binds a single Rab29 mol- 
ecule at the ARM9-10 interface (Fig. 1B). In this 
X-shaped complex, LRRK2 protomers adopt 
the same inactive conformation observed in 
the Rab29-LRRK2 monomer and the LRRK2- 
alone structure (fig. S4) (10). The two LRRK2 
protomers interact via their COR-B domains, 
in a way that is similar to what we described 
for the LRRK2 homodimer (fig. S4B) (0). 
The observation of LRRK2 dimers both in the 
presence and absence of Rab29 suggests that 


Rab29-LRRK2 tetramer 


Fig. 1. Structural determination of the Rab29-LRRK2 complex. (A) Schematic diagram showing Rab29- 
mediated LRRK2 membrane recruitment and activation. (B) Cryo-EM structures of the Rab29-LRRK2 
complex in three oligomerization states. (C) Top view of the cryo-EM structure of the Rab29-LRRK2 tetramer. 
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COR-B-mediated dimerization of LRRK2 could 
occur under physiological settings. 


A tetrameric assembly of LRRK2 


We captured the Rab29-LRRK2 complex in an 
unexpected tetrameric assembly and deter- 
mined its structure to an overall resolution 
of 3.5 A (fig. S1 and table $1). With a ~205 A 
by 260 A by 150 A dimension (Fig. 1, B and 
C), the Rab29-LRRK2 tetramer is an assem- 
bly with a twofold rather than fourfold sym- 
metry, featuring two types of LRRK2 protomers: 
LRRK2?™ (peripheral) and LRRK2°" (cen- 
tral) (Fig. 1, B and C, and Fig. 3A). We were 
able to resolve and model near full-length 
LRRK2°“ and associated Rab29°“". By con- 
trast, LRRK2“™ protomers have flexible LRR, 
ankyrin repeat (ANK), and ARM domains, 
and neither those domains nor the associated 
Rab29°"' molecules could be resolved (Fig. 3A 
and fig. S5, A to C). By contrast, the catalytic 
halves of LRRK2°", including ROC, COR, KIN, 
and WD40 domains (ROC, Ras of complex 
proteins; COR, C-terminal of ROC; KIN, ki- 
nase), were rigid and could be refined to an 
overall resolution of 3.2 A with focused refine- 
ment (fig. S1A). 

The LRRK2°*"-LRRK2°™ interaction with- 
in each asymmetric unit is mediated by the 


COR-B domains (fig. $5, C and D), and the 
COR-B-COR-B interface is similar to that seen 
in LRRK2 homodimers or Rab29-LRRK2 di- 
mers, with a subtle rotational motion (fig. S5D). 
In addition, LRRK2°“" directly interacts with 
LRRK2“" from the other asymmetric unit 
(Fig. 3, A to C), with regions near the LRRK2?™ 
ARM-ANK boundary packing against the 
LRRK2°"" WD40 domain and the LRRK2?™" 
ARM domain associated with the LRRK2°" 
ROC domain. The two LRRK2“™ protomers 
pack in a “head-to-tail” mode through WD40- 
KIN interfaces (Fig. 3, A and D). The flexible 
N-terminal part of LRRK2°" might also in- 
teract with the Rab29°™" and the ARM domain 
of LRRK2?*) but the low local resolution of 
the cryo-EM map prevents further interpreta- 
tion (fig. S5A). 

The catalytic halves of LRRK2" and LRRK2°™ 
show substantial conformational differences 
(Fig. 3E and movie S1). Upon aligning the 
COR-B domain, KIN and WD40 are displaced 
about 40° toward the COR-A and ROC do- 
mains in LRRK2“™ (fig. S5E). This conforma- 
tional rearrangement closes a central cavity 
shaped by the ROC, COR, and KIN domains 
(Fig. 3E). In this conformation, COR-B, ROC- 
COR-A, KIN N-lobe, and KIN C-lobe-WD40 
appear to move as rigid bodies (fig. S5F). Re- 


positioning of KIN C-lobe-WD40 in LRRK2“" 
disrupts the connection between the WD40 
and ARM-ANK-LRR domains in the inactive 
state, which was stabilized by the scaffolding 
hinge helix and C-terminal helix (fig. S5, G and 
H) (0). Additionally, the KIN C-lobe would 
clash into the LRR domain (fig. S51), contrib- 
uting to the displacement and flexibility of 
the ARM-ANK-LRR domains in LRRK2“"" pro- 
tomers (fig. S5A). 


LRRK2°*" has an active kinase domain 


The KIN domain of LRRK2°™ has structural 
features of an active kinase. The LRR domain 
that shields the KIN domain in the inactive 
conformation (10) is flexible in LRRK2°", 
leaving the KIN domain accessible to sub- 
strates from the membrane side (fig. S5, A and 
B). Critically, the LRRK2°™ KIN domain adopts 
a closed conformation (Fig. 4A), with the aC 
helix positioned toward the active site and 
the “DYG motif” flipped in. Lys'®® and Glu'®?° 
form a salt bridge, an interaction blocked by 
Tyr?’ in the inactive conformation (Fig. 4, B 
and C). There is a well-defined cryo-EM den- 
sity for ATP in the active site, and the distance 
between Asp” and ATP shortens to 3.8 A (from 
13.8 A in the inactive state) (Fig. 4, A and ©), 
permitting ATP hydrolysis in the presence of 
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Fig. 2. Molecular basis of Rab29-dependent LRRK2 recruitment. (A) Rab29- 
LRRK2 interface in the LRRK2 monomer state. LRRK2 and Rab29 are colored 
gray and hot pink, respectively. Side chains of interface residues are shown as sticks. 
(B and C) Impact of Rab29 or LRRK2 mutations on LRRK2 localization in HEK293 
cells. Quantification of a portion of LRRK2 overlapping with Rab29 according to 
Mander’s coefficient for confocal analysis is shown in fig. S3, A and B. Each empty 
circle represents colocalization coefficient (Mander’s coefficient) measured in one 
cell. Error bars represent SEM. Significance was determined by the Kruskal-Wallis 
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one-way analysis of variance (ANOVA) test. ****P < 0.0001; ns (not significant). 
(D and E) Quantification of the immunoblotting data shown in fig. S3, C and D. 
Data are presented as ratios of pRabl0-Thr’?/total Rabl0, pLRRK2-Ser!”°"/total 
LRRK2, and pRab29-Thr7//total Rab29, normalized to the average of LRRK2 
WT values. The data shown are the mean + SD of three determinations. 
Single-letter abbreviations for the amino acid residues are as follows: A, Ala; 

C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, Leu; M, Met; N, Asn; 
P, Pro; Q, Gln; R, Arg: S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 
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substrates. The regulatory spine (R-spine), 
formed by Leu, Leu!?2*, Tyr28, and Tyr? 
becomes continuous (Fig. 4D). Docking of 
the LRRK2°"™ model into the 14-A in situ 
cryo-electron tomography (cryo-ET) map of 
microtubule-bound LRRK2 (Fig. 4E), which 
was proposed to represent an active confor- 
mation (11), reveals close correspondence, sup- 
porting our conclusion that the LRRK2°"* KIN 
domain is in an active conformation. 

We next examined the interdomain inter- 
actions that stabilize the active conformation 
of the LRRK2°™ KIN domain. The activa- 
tion loop of the KIN domain (35) dips into the 
open pocket between COR-A and COR-B (Fig. 
4F). We hypothesized that this interdomain 
interaction stabilizes the closed conformation 
of the KIN domain and would thus be crucial for 
LRRK2 kinase activity. Indeed, single point mu- 
tations at the KIN-COR interface (P1588A, N1710A, 
and W1791A) reduced the LRRK2 kinase activity 
induced by Rab29 (Fig. 4G and fig. S6). LRRK2 
W1791A almost completely abolished LRRK2 ac- 
tivity in the absence of Rab29, indicating that the 
observed interactions (Fig. 4F) are also critical 
for the basal activity of LRRK2 (Fig. 4G and fig. 
S6C). Therefore, blocking the COR-KIN inter- 
action could be a potential strategy to inhibit 
LRRK2 allosterically. 

Compared with the inactive state, the KIN 
N-lobe rotates slightly toward the COR-B do- 
main in LRRK2“", leading to more-extensive 
interactions between the aC helix of the KIN 
domain and the docking (Dk) helix of the 
COR-B domain (Fig. 4F). These observations 
are consistent with previous HDX-MS and MD 
simulation studies that indicated an altered 
interface between the COR-B Dk helix and the 
KIN oC helix and the stabilization of a nearby 
COR-B loop (residues 1721 to 1725) upon bind- 
ing of type I inhibitors (fig. S5J) (75). 

The ROC domain is displaced relative to the 
COR-B domain upon LRRK2 activation (fig. 
S5K). COR-B structurally bridges the catalytic 
ROC and KIN domains, and GTP binding in 
the former modulates the kinase activity of the 
latter (36-39). The movement of the ROC do- 
main relative to the COR-B domain upon ac- 
tivation involves a “seesaw-like” motion of the 
ROC aC helix, with Tyr” as the pivot point (fig. 
S5K and movie $2). Our structural observations 
indicate that conformational coupling between 
the ROC and COR-B domains is vital for LRRK2 
activity by contributing to the crosstalk be- 
tween GTPase and kinase activities (10). 

We then determined the cryo-EM structure 
of LRRK2®“*¥ (RCKW: ROC-COR-KIN-WD40) 
in complex with DNL201/GNE-0877, a com- 
pound reported to be safe and well tolerated 
in a phase 1 clinical trial (40) (Fig. 5A and fig. 
S7, Ato H). LRRK2°“™ is used to simplify the 
structure determination caused by the flexi- 
bility of the N-terminal domains. DNL201 is a 
type I kinase inhibitor that fixes the LRRK2 
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kinase domain in an active-like conformation, 
as judged by the compound’s ability to induce 
dephosphorylation of Ser®”° (40, 41). The well- 
resolved kinase domain structure revealed a 


binding site for DNL201 within LRRK2 at the 
ATP-binding pocket (Fig. 5B and fig. S7, F to 
H). In the LRRK2”™ and LRRK2°*"_DNL201 
structures, KIN domains adopted a highly sim- 
ilar structure [root mean square deviation 
(RMSD), 0.7 A] (Fig. 5C), further supporting the 
active conformation of LRRK2“™ KIN domain. 
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Comparing the active conformations of 
LRRK2" and LRRK2®“*Y_DNL201, we ob- 
served several common features. The aC helix, 
activation loop, APE-oF, and aH-al linkers from 
the KIN domain are the major contributors for 
interactions with COR, and the interface between 
the KIN and COR domains are almost identical 
(Fig. 5D) despite a small displacement of COR 
domains (Fig. 5C). The seesaw motion between 
the COR-B and ROC domains is also observed, 
as seen by the Tyr” side-chain flipping (fig. S8A). 
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Fig. 3. Structure of the Rab29-LRRK2 tetramer. (A) Cryo-EM structure of the Rab29-LRRK2 tetramer 
with two different views. Peripheral Rab29 (Rab29°*") and LRRK2 (LRRK2°") are colored in hot pink and 
gray, respectively; central LRRK2 (LRRK2°°") copies are colored in blue and orange. (B and C) Interactions 
between (B) the WD40 domain of LRRK2°"t and ARM-ANK domains of LRRK2?*"' and (C) the ROC domain of 
LRRK2©" and the ARM domain of LRRK2°*". (D) Interactions between two LRRK2°°" copies. (E) Conformational 
changes in the C-terminal halves of LRRK2 upon activation. A dashed circle indicates the central cavity between 
the KIN and COR domains. Color codes for different parts of LRRK2 are as follows: ROC, green; COR-A, light 
orange; COR-B, bright orange; N-lobe of KIN, cyan; C-lobe of KIN, marine; WD40, pink. 
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However, these activation features are dif- 
ferent from or were not observed in the pre- 
vious microtubule-based LRRK2 model (PDB 
6XR4) (fig. S8B) (11). Additionally, there are 
global differences between Rab29-dependent 
and microtubule-based activation of LRRK2 
because the Rab29-LRRK2 tetramer con- 
tains asymmetric dimers and microtubule- 
based LRRK2 oligomers are symmetric (fig. S8, 
C and D). 


The conformational changes revealed by com- 
paring active LRRK2°™ or active-like LRRK2- 
DNL201 with inactive LRRK2 align very well 
with previous HDX-MS data (73-16). Overall, the 
active conformation has a more compact arrange- 
ment; the oC helix, activation loop, APE-aF loop 
of the KIN domain (Fig. 4F and Fig. 5D), and 
C-terminal part of COR-B helix (residues 1788 to 
1797) showed lower deuterium exchange (74, 15), 
owing to the closure of the central cavity upon 


activation (Fig. 3E). The only exception was the 
C-terminal half of the aC helix of ROC domain 
(residues 1426 to 1449), which became more ac- 
cessible and showed increased deuterium ex- 
change because of the seesaw motion (fig. S5K). 


Rab29-LRRK2 tetramer 
and Rab29-dependent activation 


As the Rab29-LRRK2 tetramer has two protomers 
in active kinase conformation, we hypothesized 
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Fig. 4. An active conformation of LRRK2. (A) Superposition of kinase domains 
of LRRK2°" and LRRK2°*", N- and C-lobes of the LRRK2°°"' kinase domain 
are colored in cyan and marine, respectively; the LRRK2?°"' KIN domain is 
colored in gray. (Inset) Image shows the Cryo-EM density of the ATP molecule. 
(B and C) Key catalytic residues in LRRK2°°" (B) and LRRK2°*"' (C) KIN domain 
with side chains shown as ball-and-stick models. The distances between the 
side chain of D2017 and the phosphate group of ATP are indicated with dashed 
lines. (D) R-spine of the LRRK2°*" (left) and LRRK2°*" (right) KIN domains. 
The four residues forming the R-spine (L1935, L1924, Y2018, and Y1992) are 
shown as green surfaces. (E) Docking of C-terminal catalytic halves of LRRK2°" 
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into the cryo-ET map of microtubule-bound LRRK2. (F) Movement of the KIN 
domain relative to the COR domain upon activation. (Inset) Interactions between 
the KIN and COR domains in the active conformation; side chains of the interface 
residues are shown as sticks. Dk, docking helix; APE, conserved APE motif; 

AL, activation loop. (G) Quantitative immunoblotting analysis of the cellular 
kinase activity of LRRK2-bearing mutations in the interface between the KIN and 
COR domains in fig. S6C. Data are presented as ratios of pRabl0-Thr’?/total 
Rabl0, pLRRK2-Ser!2°7/total LRRK2, and pRab29-Thr7!/total Rab29, normalized 
to the average of LRRK2 WT values. The data shown are the mean + SD of 
three experiments. 
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that the tetrameric assembly could explain 
the increased Rab29-induced LRRK2 Ser”? 
autophosphorylation (28). To test the hypoth- 
esis, we first verified that the tetramer state 
was not caused by the Rab29 Q67L mutation, 
which disrupts Rab29 membrane localization 
and impacts the Rab29-dependent LRRK2 
activation in cells (23, 27). Thus, we character- 
ized the Rab29 T71A/S72A-LRRK2 complex be- 
cause Rab29 T71A/S72A has minimal impacts 
on LRRK2 cellular localization or kinase activ- 
ity (fig. S9, A to D) (28). Our cryo-EM analysis 
showed that Rab29 T71A/S72A-LRRK2 forms 
tetramers during 2D classification performed 
with cross-correlation (fig. S9E), although there 
was a lower ratio of tetramer particles than with 
Rab29;,-LRRK2, likely because of a lower per- 
centage of GTP-bound Rab29. 

We then compared Rab29-LRRK2 and Rab32- 
LRRK2 complexes to dissect the role of LRRK2 
tetramerization in kinase activation. Rab32, a 
close homolog of Rab29 (~56% sequence iden- 
tity and ~70% similarity) can mediate LRRK2 
membrane recruitment (37) but does not sup- 
port LRRK2 activation in human embryonic 
kidney 293 (HEK293) cells, as indicated by low 
Ser’?®? autophosphorylation levels (28) and by 
a higher level of Ser°®’ phosphorylation (Fig. 6, 
A and B), which is associated with inactive 
LRRK2 (41). We determined the cryo-EM struc- 
ture of Rab32-LRRK2 complexes (fig. S10, A 
to C). Rab32 interacts with LRRK2 through an 
interface that is almost identical to that used 
by Rab29 (fig. S10, D and E), but the Rab32- 
LRRK2 complex was captured in two oligomer- 
ization states: Rab32-LRRK2 monomer and 
Rab32-LRRK2 dimer (Fig. 6C). Furthermore, 
reprocessing of our previous LRRK2-alone data- 
set (10) showed no LRRK2 tetrameric assembly 
(Fig. 6D and fig. SIOF). Therefore, we conclude 
that the Rab29-LRRK2 tetramer is associated 
with the Rab29-dependent activation of Ser”? 
autophosphorylation. 

In contrast to our results for Ser’’°? auto- 
phosphorylation, we found that phosphoryl- 
ation of Rab10 by LRRK2 is stimulated by both 
Rab29 and Rab32 (Fig. 6B), suggesting that 
membrane recruitment of LRRK2 is sufficient 
to activate Rab10 phosphorylation without a 
requirement for tetramerization. Rab10 phos- 
phorylation and Ser’?®? autophosphorylation 
are thus independent molecular events during 
LRRK2 signaling (42, 43). Endogenous Rab38 
was also reported to increase Rab10 phospho- 
rylation but not Ser’**? autophosphorylation in 
melanocytes (44). Therefore, we predict that 
Rab38 should mediate LRRK2 membrane recruit- 
ment but not the tetramerization of LRRK2. 


Discussion 


In this study, we have presented Rab29-LRRK2 
structures in both active and inactive states, 
which allowed us to analyze PD mutations in the 
context of kinase activation. Gain-of-function 
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mutations at six sites—G2019S, I2020T, Y1699C, 
N1437H, R1441C/G/H, and S1761R (fig. S11A)— 
have been proposed to be high risk and PD- 
causing (45). Our previous structure of LRRK2 
bearing the G2019S substitution showed little 
difference from the wild-type (WT) LRRK2 in 
the inactive state (10). However, this muta- 
tion could induce additional interactions with 
Glu'®”° and stabilize the critical Lys!°°°-Glu'®?° 
salt bridge in the active conformation (fig. SI1C). 
Tle”? moves from a hydrophobic to a hydro- 
philic environment upon LRRK2 activation (fig. 
SIIB), and the I2020T mutation would desta- 
bilize the inactive conformation and favor the 
active conformation. Tyr”, Asn“, and Arg 
are at the interface between the aC helix of ROC 
and the COR-B domain, where a seesaw-like 
motion of the ROC-oC helix occurs upon LRRK2 
activation (fig. SID, fig. SsK, and movie S2). 
Asn“*? and Arg™" are located at one side of the 
seesaw and anchor the C-terminal part on the 
aC helix of ROC to the surface of the COR-B 
domain in the inactive state (fig. S11D). There- 
fore, mutations of Asn’®’ and Are™" should 
weaken the anchoring effect, shifting the bal- 
ance of the seesaw toward the N-terminal of 


: DNL20 
\ty_ 


2 KIN domain 
GP? RMSD 0.7A 


LRRK2°* vs LRRK2-DNL201 
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the ROC oC helix, hence promoting LRRK2 
activation. Tyr’? functions as the pivot point 
(fig. SID and movie S2), and its substitution 
with a smaller residue would lower the energy 
barrier for the seesaw motion and for the tran- 
sition from the inactive to the active state. The 
above structural observations lead us to con- 
clude that increased conformational dynam- 
ics of G2019S, I12020T, Y1699C, N14.37H, and 
R1441C/G/H mutations play an important role 
in PD pathogenesis. 

The different modes of activation of LRRK2 
with Rab29 or its close homologs, Rab32 or 
Rab38, are intriguing. Although Rab29, Rab32, 
and Rab38 could all activate Rab10 phospho- 
rylation by LRRK2, only Rab29 promotes the 
formation of LRRK2 tetrameric assembly, boosts 
Ser®? autophosphorylation, and is associated 
with late-onset PD. Moreover, LRRK2 Ser’”%” 
autophosphorylation is elevated in urinary exo- 
somes of LRRK2 mutation carriers (46-48), but 
the association between Rab10 phosphorylation 
and LRRK2 mutations varies across different 
studies (46-50). We thus speculate a pathogenic 
association between Rab29-dependent LRRK2 
tetramerization and Ser’””” autophosphorylation 


DNL201 


LRRK2 vs LRRK2-DNL201 


Fig. 5. Structure of LRRK2°°™™’ with DNL201L (A) Cryo-EM map of LRRK2°°™” in complex with type | inhibitor 
DNL201. (Insets) Cryo-EM densities of the DNL201 inhibitor (top) and the surrounding residues (bottom) are shown. 
(B) DNL201 binding site [magnified from bottom inset of (A)]. Side chains of DNL2O1-interacting residues are shown as 
sticks. (C) Structural comparison of DNL201-bound LRRK2°°™ (gray) and LRRK2®"* (blue) structures. The kinase domain 
alignment is shown on the right. (D) Comparison of KIN-ROC interface between LRRK2®°“"-DNL201 (gray) and 
LRRK2°" (blue and orange). Key structural elements from KIN domain involved in the interaction are labeled. 
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Fig. 6. Rab29-LRRK2 tetramer and kinase activation. (A and B) Quantitative 
immunoblotting analysis of the cellular kinase activity of LRRK2 in the presence of 
Rab29 or Rab32. HEK293 cells were transiently cotransfected with WT LRRK2 and 
hemagglutinin (HA)-tagged empty vector (‘-"), HA-tagged Rab29, or HA-tagged 
Rab32 (WT or Q85L mutant). Data are presented as rat 


in patients with LRRK2 mutations. It would 
also be interesting to examine the Ser”®? auto- 
phosphorylation in PD patients with PARKI6 
variations. 

LRRK2 activation features a striking dimer- 
of-asymmetric dimer assembly containing two 
active core subunits encased by two inactive 
peripheral protomers. Intermolecular interac- 
tions between two asymmetric LRRK2 dimers 
stabilize the active Rab29-LRRK2 tetramer 
(Fig. 3, A to D). The formation of active tetra- 
mers is Rab29-dependent because such an assem- 
bly was not observed with Rab32 or without 
Rab29 under similar experimental conditions 
(Fig. 6, C and D) (10). Furthermore, Rab29 bind- 
ing to the extended ARM-ANK-LRR portion 
of LRRK2°™ appears to unlock or facilitate 
LRRK2“" activation (fig. S5A). However, the 
low resolution of LRRK2°" ARM-ANK-LRR 
domains prevents us from dissecting these pu- 
tative mechanisms in atomic details. Neverthe- 
less, this Rab29- and oligomerization-controlled 
asymmetric activation of LRRK2 adds a new 
mode of kinase asymmetric activation, cur- 
rently represented by EGFR (57), IRAK4: (52), 
and B-Raf (53, 54). 

Lastly, LRRK2 activation is clearly a com- 
plex process and could be achieved by other 
mechanisms, such as lipid oxidation, Rab12, and 
microtubule-based filamentation (17, 12, 55-58). 
This study focuses on Rab29-induced LRRK2 
activation and provides a framework for in- 
terpreting disease mutations in the context of 
kinase activation. Our data suggest that allo- 
steric inhibition of LRRK2 could be potentially 
achieved by disrupting Rab29-LRRK2 inter- 
action, blocking LRRK2 oligomerization, or 
preventing the conformational transition from 
the inactive to the active states. Thus, our find- 
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ings provide novel insights into LRRK2-based 
drug development and PD treatment. 
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Polycyclic aromatic hydrocarbons in samples 
of Ryugu formed in the interstellar medium 
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Polycyclic aromatic hydrocarbons (PAHs) contain $20% of the carbon in the interstellar medium. 
They are potentially produced in circumstellar environments (at temperatures 21000 kelvin), by 
reactions within cold (~10 kelvin) interstellar clouds, or by processing of carbon-rich dust grains. 

We report isotopic properties of PAHs extracted from samples of the asteroid Ryugu and the meteorite 
Murchison. The doubly-°C substituted compositions (A2x!°C values) of the PAHs naphthalene, 
fluoranthene, and pyrene are 9 to 51%o higher than values expected for a stochastic distribution of 
isotopes. The A2x1°C values are higher than expected if the PAHs formed in a circumstellar 

environment, but consistent with formation in the interstellar medium. By contrast, the PAHs 
phenanthrene and anthracene in Ryugu samples have A2x°C values consistent with formation 


by higher-temperature reactions. 


olycyclic aromatic hydrocarbons (PAHs)— 
organic molecules consisting of multiple 
aromatic rings—are ubiquitous in the in- 
terstellar medium (ISM). Based on obser- 
vations of mid-infrared emission bands in 
the ISM, PAHs are present in abundances ~10~” 
times that of hydrogen (7). PAHs are estimated 
to contain <20% of the carbon atoms in the 
ISM of the Milky Way (J, 2) and other galaxies 
(3). PAHs have been proposed as building blocks 
of carbon-rich dust grains, which are abundant 
in the ISM (4), and of higher molecular weight- 
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insoluble organic material (IOM) that com- 
prises most of the carbon within meteorites 
(5). However, it is unknown which chemical 
processes produce these forms of reduced car- 
bon or where they occur (Fig. 1) (6). 

Small aromatic organics, such as PAHs con- 
taining only a few rings, can form through 
reactions of free radicals in the gas phase, 
particularly the hydrogen-abstraction-carbon- 
addition (HACA) reaction mechanism, which 
is expected to occur in hot (>1000 K) circum- 
stellar environments around carbon-rich as- 


ymptotic giant branch (AGB) stars and on 
Earth by combustion (Fig. 1A) (6). Carbon- 
rich dust grains and IOM could potentially be 
formed through similar processes. However, 
the reaction rates of these high-temperature 
mechanisms are too slow to account for the 
amount of PAHs present within the ISM and 
there is no complete model of the synthesis of 
PAHs within the outflows of AGB stars (6, 7). 

PAHs could also be formed through the break- 
down of carbon-rich dust grains by shock waves, 
cosmic rays, or ultraviolet photolysis (8). How- 
ever, these same processes destroy PAHs (Fig. 
1B). This destruction occurs on time scales (9, 10) 
that are shorter than those expected for pro- 
duction of PAHs in circumstellar envelopes of 
AGB stars (11, 12). 

A third location where PAHs could form is 
in cold (~10 K) molecular clouds within the 
ISM, through either ion-molecule reactions 
(13), or rapid barrierless reactions involving 
radicals (Fig. 1C) (7). Laboratory experiments 
have characterized these chemical mechanisms 
but it is difficult to directly observe specific PAH 
molecules within interstellar molecular clouds 
using spectroscopic methods. The only species 
that have been identified within molecular 
clouds are nitriles derived from PAHs: benzo- 
nitrile (74) and cyanonaphthalenes (15). There- 
fore, it is unlikely that circumstellar synthesis 
dominates the formation of extraterrestrial PAHs, 
but there is little evidence for interstellar for- 
mation either. 

Later secondary processing reactions within 
a parent body—the asteroid or other Solar 
System object that meteorites originate from— 
could also synthesize PAHs or alter their com- 
position. These reactions are often related to 
aqueous alteration, the modification of solid 
material by reactions with liquid water, which 
is known to have occurred on parent bodies. 
Potential secondary reactions include Fischer- 
Trospch-type (FTT) synthesis of alkanes from 
carbon monoxide (CO) (J6) followed by aro- 
matization; exchange with dissolved inorganic 
carbon (DIC) (17) or aqueous Hg; or the break- 
down of IOM into smaller organic molecules 
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by catagenesis (78), which thermally cracks 
large organic molecules (on Earth this forms 
oil and gas deposits). 

Soluble PAHs have been studied in samples 
of carbonaceous chondrite (CC) meteorites but 
as a result of the terrestrial exposure of mete- 
orite samples, it is possible that endogenous 
PAHs in meteorites could be contaminated 
with PAHs formed on Earth. Samples of the 
near-Earth carbonaceous asteroid (162173) 
Ryugu were collected by the Hayabusa2 space- 
craft under controlled conditions, and so un- 
derwent fewer opportunities for terrestrial 
contamination. We studied the isotopic prop- 
erties of PAHs in the CC meteorite Murchison 


and a sample of Ryugu to investigate PAH for- 
mation processes. 


Principles of isotope analysis 


The rare-isotope distributions of organic mol- 
ecules can be used to constrain the source, sub- 
strate, and chemical mechanisms responsible 
for their formation (19, 20). The molecular- 
average “C/”C and D/H ratios of organic mol- 
ecules are conventionally reported as &Cyppp 
and 5Dysmow, respectively, where 6 notation is 
defined in equation $7, and VPDB and VSMOW 
are standard reference scales (27). Measured 
isotope ratios of extraterrestrial organics are 
often outside the ranges of ratios measured in 


terrestrial organics. Some extraterrestrial or- 
ganic molecules such as amino acids (22) are 
enriched in ’C and have been interpreted as 
products of precursor molecules that were 
themselves derived from a reservoir of CO in 
the ISM that was enriched in °C (23, 24). By 
contrast, interstellar PAHs exhibit only small 
#8C enrichments, consistent with formation in 
a range of extraterrestrial environments. Het- 
erogeneities in the carbon isotope composi- 
tions of interstellar, circumstellar, and parent 
body carbon reservoirs prevent unique inter- 
pretation of such data (Fig. 1). 

Multiple substitutions of heavy isotopes with- 
in molecules (hereafter referred to as clumping) 


a ee 
Table 1. PAH abundance and isotope ratio measurements. 5!Cyppp, 8Dvsmow, 2x8C/C ratios, and A2x8C values of PAHs extracted from Ryugu samples 
A0106 and CO107, the Murchison (Murch) meteorite, and terrestrial standards. Ou Cvenes SDysmow, and A2x'8C values are reported in units of %o. The number 
of replicate analyses, n, is included in parentheses (21). o is the standard error propagated in quadrature (21). All uncertainties on the abundances of PAHs in Ryugu 


samples are 25% (21). 


Abundances 
of PAHs 5° Cvepp + o 5Dysmow + 6 2x8¢/7C +6 A2x®C +6 
Compound in Ryugu (n) (n) (n) (n) 
samples/ppm 
Ryugu Ryugu Terrestrial Ryugu Ryugu Murch Terrestrial Ryugu Terrestrial Ryugu Murch Ryugu Murch 
A0106 C0107 standard A0106 C0107 standard C0107 standard C0107 C0107 
Naphthalene 0.20 0.06 Sas =i/l5s8  =210),5) oe - =6//.3) 32 72+ 0.0048+ 0.0050 + - 333) 3 : 
0.5(2) 246(2) 4.9 (3) 0.4 (3) 36.0 (4) 0.0060 (7) 0.0107 (7) 20 (7) 
pe a Sd gk ee CEN LIEGE a RGOk S IMGhET so 
0.6 (2) 6.9 (2) 4.2 (3) Lil @)) 31.0 (4) 0.0046 (7) 0.0066 (7) 10 (7) 
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Fig. 1. Potential pathways for extraterrestrial PAH formation. The central inset shows the molecular 
structures of the five PAHs we investigated. The surrounding panels schematically illustrate potential 
formation pathways for those PAHs. In (A), (C), and (D), grayscale color bars show 3°C\ppe, SDysmow. 
and A2x!3¢ values measured or predicted in extraterrestrial materials. White is isotopically depleted and 
black is isotopically enriched. Dots and arrows indicate values and ranges, respectively, of the source carbon 
and hydrogen. A2x/$C values are estimated predictions (21) based on model results (Fig. 2), see text. 

(A) PAH formation in hot (21000 K, red) circumstellar environments by molecular mass growth reactions (6, 11). 
3'°Cyppg Values in AGB stars are expected to range from 0 to hundreds of per-mille depending on the stellar 
evolution (35, 36). 5Dysow values are expected to be low or zero as a result of the fusion of D in stars (35, 36). 
texpuision 'S the time scale for PAH expulsion from stellar envelopes. (B) Shock waves and ultraviolet radiation 
form PAHs by breaking down carbon-rich dust but also destroy them. tpreakdown is the time scale for PAH 
breakdown in the ISM. (C) PAH formation in cold (10 K, blue) interstellar environments through barrierless 
reactions (6, 12). Reduced carbon in molecular clouds is depleted in °C compared with interstellar CO (47), 
whereas interstellar hydrogen is typically D-enriched (33). (D) PAH formation or modification on a parent 
body at moderate temperatures (100s of K, orange). Isotopic exchange can occur with carbon reservoirs 
such as CO and DIC, and hydrogen reservoirs such as Ho. Murchison carbonate has 3Cvpps values of +20 to 
+80%o (48) whereas the water in parent bodies of CC meteorites has been found to be D-depleted (40, 42). 


reflect temperature-dependent chemical re- 
actions and physical processes (19) thus pro- 
viding additional information on a given 
molecule’s source and formation/degrada- 
tion history. For any given molecular-average 
5"°Cypppx value of PAHs, a smaller proportion of 
each molecule is doubly-“C substituted (denoted 
2xC). We report the ratio ‘2xC/"C of clumped 
¥8C isotopologues to the unsubstituted isoto- 
pologues as well as the differences—A2xC— 
between the measured 2x"C/”C ratios and the 
expected stochastic 2x"C/"C ratios. Expected 
2x°C/"C ratios are calculated based on the ran- 
dom probability of a double-’C-substitution 
given its molecular-average “C abundance (27). 
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A formation process that leads to a higher (lower) 
number of heavy isotope substitutions than 
the stochastic expectation leads to a positive 
(negative) A2x'°C value. 

Variations in 2x”C clumping can be 
temperature-dependent because heavy iso- 
topic substitution lowers the molecular vi- 
brational energy (fig. S4) of the C-C bond and 
therefore stabilizes it. Isotopic clumping is 
therefore more energetically favored at lower 
temperatures. We used acetylene as a model 
compound to predict 2x!’C clumping at dif- 
ferent temperatures, because it is thought to 
be a precursor to the formation of interstel- 
lar aromatic reduced carbon (25) including 
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Fig. 2. Predicted A2x!°C values from our model. 
The black curve shows our theoretical calculations 
of the A2x'SC values expected for acetylene at 
different temperatures (21). Labels below the plot 
indicate the relevant environments at each temper- 
ature and use the same colors as Fig. 1. 


PAHs (26, 27) (Fig. 1A). The equilibrium ex- 
change reaction between single and double- 
18C substitution in acetylene is: 
2xBCPCH, = 3C,H» + sd CoHo (1) 
The products on the right-hand side of Eq. 1 
are favored at all temperatures (Fig. 2) (27). We 
therefore predict that PAHs formed in the cold 
ISM have A2x”°C values substantially different 
from zero (positive if formed by reversible re- 
actions; Figs. 1C and 2). By contrast, we expect 
PAHs formed in hot circumstellar envelopes 
or in the parent body to have A2x"°C close to 
zero (Figs. 1A and 2). 


Isotopic measurements of PAH samples 


The Hayabusa2 spacecraft collected 5.4 g of ma- 
terial from two locations on Ryugu (28). Previ- 
ous analysis of Ryugu samples has demonstrated 
that Ryugu is compositionally similar to Ivuna- 
type carbonaceous (CI) meteorites (29). Solu- 
ble organic molecules previously have been 
extracted (22, 30) from Ryugu samples AO106 
and C0107, which are aggregates of sub- 
millimeter grains collected from the first and 
second touchdown sites, respectively (22, 28). 
Endogenous, complex soluble aromatic organ- 
ics have been identified among the compounds 
extracted from these samples using the solvent 
dichloromethane (DCM) (30). The molecules 
identified included several PAHs: the 2-ring 
naphthalene, 3-ring isomers phenanthrene and 
anthracene, and 4-ring isomers fluoranthene 
and pyrene. The total PAH abundances in the 
Ryugu samples (8.6 to 32.4 nmol g™’; Table 1 
and fig. S2) were too low for isotopic analysis 
using gas-source isotope ratio mass spectrom- 
etry (22, 30). 

We analyzed PAHs within the same DCM ex- 
tracts of Ryugu as those previous studies (22, 30) 
using gas chromatography coupled with Orbi- 
trap mass spectrometry (GC-Orbitrap) (37). We 
measured the 5”’Cyppg values of the five PAHs 
previously identified in Hayabusa2 sample 
A0106 and the 3° Cyppe; d5Dysmow and A2x#C 


22 DECEMBER 2023 « VOL 382 ISSUE 6677. 1413 


RESEARCH | RESEARCH ARTICLES 


values of the same five PAHs from C0107 (21). 
We compared these measurements with those 
of fluoranthene and pyrene extracted from the 
CC meteorite Murchison (21, 31). We also mea- 
sured 8’Cyppp and A2x'C values of com- 
busted plant biomass as a high-temperature 
comparison (2/). As a control sample, we used 
a serpentine blank processed in parallel with 
the Ryugu samples and within this blank we 
detected no PAHs above the instrumental back- 
ground (table S1). Measured isotope values are 
listed in Table 1 and plotted in Fig. 3. 

We found elevated A2xC values of 9 to 51%o 
for naphthalene, fluoranthene, and pyrene from 
Ryugu and fluoranthene from Murchison, which 
significantly exceed the expected stochastic val- 
ues (all P-values < 0.005) (27). The §’Cyppx values 
are the same (within analytical uncertainties) 
for PAHs with even and odd numbers of rings 
in Ryugu and for fluoranthene and pyrene in 
Murchison. 5Dysyow Values for naphthalene, 
fluoranthene, and pyrene from Ryugu are 320 
to 570%o higher than Ryugu’s phenanthrene 
and anthracene (Table 1). 


Cold PAH formation processes 


The A2x™C values we measured for naphtha- 
lene, fluoranthene, and pyrene from Ryugu and 
fluoranthene from Murchison are larger than 
previously measured A2x"°C values of ethane 
formed by processes at Earth-surface condi- 
tions (~1%o) (32). These large positive anom- 
alies are consistent with the hypothesis that 
some (or all) of these PAHs were synthesized 
within cold (<50 K) environments (Figs. 1C 
and 3, A and C) by means of a process that 
preferentially forms “C-C bonds. The clumped- 
*®C composition of all PAHs we measured is 
independent of molecular-average 8Cyppp val- 
ues, which span a narrow range. This is con- 
sistent with the low-temperature process that 
formed these PAHs having a high yield, such 
that the overall °C abundances of product PAHs 
approached those of the reactants. We expect 
PAHs formed within molecular clouds to have 
higher 5Dysyow values than those of PAHs 
formed within other environments (Fig. 1C) (33) 
for the same chemical physics reasons that cause 
interstellar CO or acetylene to be enriched in 
*®C or ’C-clumping, respectively. 

The highest A2x'°C value we measure, 51 + 
13%o0 for Murchison fluoranthene, is close to 
the acetylene equilibrium value at ~10 K, which 
we interpret as evidence that most of the flu- 
oranthene in Murchison was synthesized in 
the ISM (34). We suggest that low-temperature 
reactions were responsible for both the initial 
formation of C-C bonds and the subsequent 
assembly of high-molecular-weight aromatic 
compounds from smaller molecules, because 
otherwise the assembly of larger structures 
would dilute the initial clumping. The A2x”°C 
value of 4-ring fluoranthene from Murchison 
is higher than those of the 2- and 4-ring PAHs 
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from Ryugu, which have positive but lower 
A2x/8C values. We suggest three scenarios for 
Ryugu’s 2- and 4-ring PAHs: (i) They could 
have formed at higher (but still low) temper- 
atures than Murchison’s fluoranthene, 20 to 
50 K; (ii) they might be mixtures from two or 
more sources, with 20 to 50% formed in cold 
interstellar environments and the remainder 
formed (or reprocessed) at higher temperatures 
in other environments; (iii) they could have 
formed from 2- or 3-carbon precursors with 
large positive A2x'°C values produced at low 
ISM temperatures, which were subsequent- 
ly converted into larger aromatic molecules 
through reactions that diluted the initial 
A2x'8C signature. 


Hot PAH formation processes 


Three of the extraterrestrial PAHs we exam- 
ined have A2x'°C values that are consistent 


with calculated stochastic values (P-values > 
0.15) (27): the 3-ring PAHs phenanthrene and 
anthracene from Ryugu and 4-ring PAH pyrene 
from Murchison. We do not expect synthesis of 
PAHs at high temperatures to produce A2x”’C 
values that can be distinguished from zero using 
our methods (27). We also measured no statis- 
tically significant (P-values > 0.04) deviations 
from the stochastic distributions of doubly- 
™C-substituted species in the PAHs from com- 
busted plant biomass (Fig. 3, B and C, and 
table S3) (21). 

High-temperature processes that could have 
formed the 3-ring PAHs from Ryugu and 
pyrene from Murchison include: (i) conden- 
sation in the outflows of carbon-rich AGB stars 
through bottom-up synthesis, such as the HACA 
mechanism, or (ii) top-down catagenetic break- 
down of larger carbonaceous dust grains. We ex- 
pect that the formation of aromatic compounds 
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Fig. 3. °C clumping measured from PAHs in the Ryugu samples. 2x"°C/"°C ratios are plotted as a 
function of 8!Cyppg for: (A) the 2-ring PAH naphthalene (black square); (B) 3-ring PAHs phenanthrene (black 
cross) and anthracene (black circle); and (C) 4-ring PAHs fluoranthene (black triangle) and pyrene (black 
open crossed square). Black symbols were measured from Ryugu sample C0107; terrestrial standards 
(Std) are shown with unfilled gray symbols. (C) includes measured values for Murchison (Murch) meteorite 
(purple symbols). Panels B and C include residues of combusted plants (green symbols) (21, 49). Predicted 
stochastic distributions are plotted as dotted gray lines (21). Departures from stochasticity (A2xC+o 
values) are indicated with labeled blue arrows (21). All error bars are lo and numerical values are listed in 
Table 1 and table S3. (D) 8Cvppe, SDysmow, and A2x5C values measured for all five PAHs (see legend) in 
the Ryugu samples. Blue and orange ovals indicate PAHs that we interpret as formed by interstellar and 


circumstellar/parent body processes, respectively. 
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by high-temperature processes in circumstel- 
lar environments would produce PAHs with 
high 5°Cyppr (35) and low d5Dvysmow values, 
due to the nuclear fusion of deuterium low- 
ering its abundance, and the subsequent con- 
vective mixing of D-poor ashes from the core 
of the star to the outer layers (Fig. 1A) (36). 
This interpretation is consistent with our mea- 
surements of Ryugu phenanthrene and an- 
thracene. A previous computational study of 
HACA formation of PAHs predicted an ex- 
cess of the 3-ring PAHs phenanthrene and 
anthracene (compared with other PAHs with 
different numbers of rings) (37). Thus, PAHs 
formed by HACA could dominate the 3-ring 
species in Ryugu while making a small contri- 
bution to other PAH species. 


Contributions from parent body processes 


PAHs in Ryugu or Murchison samples that have 
A2x*°C values close to zero alternatively could 
have been formed or altered by secondary pro- 
cesses (Fig. 1D). Previous experiments that 
performed pyrolysis (reactions that simulate 
catagenesis), on meteoritic IOM formed the 
same 2-, 3-, and 4-ring PAHs that we found in 
A0106 and C0107, at temperatures experi- 
enced by Ryugu’s parent body (<200°C) (38). 
FTT synthesis is a less likely explanation for 
these samples because FTT products such as 
alkanes are unlikely to undergo aromatiza- 
tion below 200°C (39). 

We expect PAHs that formed on the parent 
body or experienced secondary processing to 
have low 5Dysmow values (Fig. 1D). A previous 
experiment demonstrated that organic mole- 
cules formed under hydrothermal conditions 
from CO have 5Dysymow values of -450 to 
-580%o (40), which is consistent with the 
values we measured for phenanthrene and 
anthracene from Ryugu (Fig. 3B). Hydrogen 
atoms in PAHs that were formed by primary 
interstellar processes could exchange isotopes 
and equilibrate with the parent body water (47). 
Ryugu is known to have experienced aque- 
ous alteration (29). Some CC meteorites are 
known to contain water depleted in D [includ- 
ing Murchison, which has 6Dysmow ~—440%o 
(42)], but the 5Dysmow value of Ryugu parent 
body water is unknown. Carbon-rich grains in 
Ryugu samples have heterogeneous 5Dysmow 
values, ranging from —1000%bo (no D detected) to 
+10,000%o (40), indicating that parent body 
processes did not fully equilibrate hydrogen 
isotopic distributions in the Ryugu samples. 

Formation of PAHs by means of synthesis in 
the parent body could explain the differences 
between the A2xC values of fluoranthene and 
pyrene from Murchison. Murchison is known 
to have experienced aqueous alteration, which 
is variable even among different chips of the 
same specimen (43). Our measurements show 
that fluoranthene—which is the metastable 
isomer of the two four-ring PAHs that we 
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studied (44)—has A2xC of 51 + 13%, which 
is consistent with preferential C-clumping at 
cold interstellar temperatures. Pyrene—the 
more stable isomer (44)—has A2x’°C values 
close to the stochastic distribution. Formation 
of PAHs under parent body conditions would 
be closer to thermodynamic equilibrium, favor- 
ing production of more stable isomers (pyrene 
is preferred to fluoranthene) (45). After mixing 
with preexisting PAHs formed through other 
pathways, the additional stable PAHs would 
dilute and potentially erase any preexisting 
interstellar isotopic fingerprints, such as °C 
clumping (see Supplementary Text). 


Conclusions 


We suggest that the diversity in 8’Cyppp, 
38Dysmow, and A2x”’C values measured in PAHs 
from Ryugu and Murchison is consistent with 
PAHs being formed in at least two different 
settings (27, 46). Most of the 2- and 4-ring PAHs 
were synthesized in cold interstellar environ- 
ments whereas the other PAHs we measured 
(including the 3-ring PAHs) were formed by 
synthesis or reprocessing within high- to 
moderate-temperature settings, such as cir- 
cumstellar environments or the parent body. 
The non-zero °C clumping was measured in 
multiple PAHs from two samples that have 
different histories of alteration, collection, and 
(potentially) formation. By extrapolation, we 
infer that a large fraction of extraterrestrial 
PAHs were formed by interstellar processes. 
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Global medium-range weather forecasting is critical to decision-making across many social and economic 
domains. Traditional numerical weather prediction uses increased compute resources to improve forecast 
accuracy but does not directly use historical weather data to improve the underlying model. Here, we 
introduce GraphCast, a machine learning—based method trained directly from reanalysis data. It predicts 
hundreds of weather variables for the next 10 days at 0.25° resolution globally in under 1 minute. GraphCast 
significantly outperforms the most accurate operational deterministic systems on 90% of 1380 verification 
targets, and its forecasts support better severe event prediction, including tropical cyclone tracking, 
atmospheric rivers, and extreme temperatures. GraphCast is a key advance in accurate and efficient weather 
forecasting and helps realize the promise of machine learning for modeling complex dynamical systems. 


t is 05:45 UTC (coordinated universal time) 

in mid-October 2022 in Bologna, Italy, at 

the recently opened high-performance com- 

puting facility of the European Centre for 

Medium-Range Weather Forecasts (ECMWF). 
For the past several hours, the Integrated Fore- 
casting System (IFS) has been running sophis- 
ticated calculations to forecast Earth’s weather 
over the next days and weeks, and its first pre- 
dictions have just begun to be disseminated to 
users. This process repeats every 6 hours, every 
day, to supply the world with the most accu- 
rate weather forecasts available. 

The IFS, and modern weather forecasting 
more generally, are triumphs of science and 
engineering. The dynamics of weather sys- 
tems are among the most complex physical 
phenomena on Earth, and each day, countless 
decisions made by individuals, industries, and 
policy-makers depend on accurate weather fore- 
casts, from deciding whether to wear a jacket 
to deciding whether to flee a dangerous storm. 
The dominant approach for weather forecasting 
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today is numerical weather prediction (NWP), 
which involves solving the governing equations 
of weather using supercomputers. The success 
of NWP lies in the rigorous and ongoing re- 
search practices that provide increasingly de- 
tailed descriptions of weather phenomena and 
in how well NWP scales to greater accuracy 
with greater computational resources (J, 2). 
As a result, the accuracy of weather forecasts has 
increased year after year, to the point where 
the path of a hurricane can be predicted many 
days ahead—a possibility that was unthink- 
able even a few decades ago. 


But while traditional NWP scales well with 
compute, capitalizing on the vast amount of 
historical weather data to improve accuracy is 
not straightforward. Rather, NWP methods 
are improved by highly trained experts inno- 
vating better models, algorithms, and approx- 
imations, which can be a time-consuming and 
costly process. 

Machine learning-based weather prediction 
(MLWP)—wherein forecast models are trained 
from historical data, including observations and 
analysis data—offers an alternative to tradi- 
tional NWP. MLWP has the potential to im- 
prove forecast accuracy by capturing patterns 
in the data that are not easily represented in 
explicit equations. MLWP also offers oppor- 
tunities for greater efficiency by exploiting 
modern deep learning hardware, rather than 
supercomputers, and striking more favorable 
speed-accuracy trade-offs. Recently, MLWP 
has helped improve on NWP-based forecast- 
ing in regimes where traditional NWP is rela- 
tively weak, for example, in subseasonal heat 
wave prediction (3) and precipitation now- 
casting from radar images (4-7), where ac- 
curate equations and robust numerical methods 
are not as available. 

In medium-range weather forecasting—the 
prediction of atmospheric variables up to 10 days 
ahead—NWP-based systems such as the IFS 
are still most accurate. The top deterministic 
operational system in the world is ECMWF’s 
high-resolution forecast (HRES), a configuration 
of IFS that produces global 10-day forecasts 
at 0.1° latitude and longitude resolution, in 


————————————— 
Table 1. Weather variables and levels modeled by GraphCast. The numbers in parentheses in the 
column headings are the number of entries in the column. Boldfaced variables and levels indicate 
those that were included in the scorecard evaluation. All atmospheric variables are represented at 


each of the pressure levels. 


Surface variables (5) 


2-m temperature (27) 
10-m u wind component (10U) 
10-m v wind component (10V) 
Mean sea level pressure (MSL) 

Total precipitation (TP) 


Atmospheric variables (6) 


Temperature (7) 

U component of wind (U) 
V component of wind (V) 
Geopotential (Z) 
Specific humidity (Q) 
Vertical wind speed (W) 


Pressure levels (37) 


1, 2, 3, 5, 7, 10, 20, 30, 50, 70, 
100, 125, 150, 175, 200, 225, 
250, 300, 350, 400, 450, 500, 
550, 600, 650, 700, 750, 775, 
800, 825, 850, 875, 900, 925, 
950, 975, and 1000 hPa 
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Fig. 1. Model schematic. 
(A) The input weather 
state(s) are defined on a 
0.25° latitude-longitude grid 
comprising a total of 721 x 
1440 = 1,038,240 points. 
Yellow layers in the close-up 
pop-out window represent 
the five surface variables, 
and blue layers represent the 
six atmospheric variables 
that are repeated at 37 
pressure levels (5 + 6 x 

37 = 227 variables per 
point in total), resulting 

in a state representation 

of 235,680,480 values. 

(B) GraphCast predicts the 
next state of the weather 
on the grid. (C) A forecast 
is made by iteratively 
applying GraphCast (GC) 
to each previous predicted 
state, to produce a 
sequence of states that 
represent the weather at 
successive lead times. 

(D) The encoder component 
of the GraphCast archi- 
tecture maps local regions 
of the input (green boxes) 
into nodes of the multimesh 
graph representation (green, 


i ag 
tHe ee el 


upward arrows that termi- M° mM! 


nate in the green-blue node). 
(E) The processor compo- 
nent updates each multi- 
mesh node using learned 
message-passing (heavy blue 
arrows that terminate at a 


A Input weather state 


LE aN 
/d2 eb etat 


B Predict the next state 


C Roll out a forecast 


node). (F) The decoder component maps the processed multimesh features (purple nodes) back onto the grid representation (red, downward arrows that terminate 
at a red box). (G) The multimesh is derived from icosahedral meshes of increasing resolution, from the base mesh (M°, 12 nodes) to the finest resolution (M°, 40,962 nodes), 
which has uniform resolution across the globe. It contains the set of nodes from M° and all the edges from M° to M°. The learned message-passing over the 

different meshes’ edges happens simultaneously, so that each node is updated by all of its incoming edges. [The Earth texture in the figure is used under CC BY 4.0 


from https://www.solarsystemscope.com/textures/] 


around an hour (8). However, over the past 
several years, MLWP methods for medium- 
range forecasting trained on reanalysis data 
have been steadily advancing, facilitated by 
benchmarks such as WeatherBench (8). Deep 
learning architectures based on convolutional 
neural networks (9-17) and Transformers (12) 
have shown promising results at latitude and 
longitude resolutions coarser than 1.0°, and 
recent works—which use graph neural networks 
(GNNs), Fourier neural operators, and Trans- 
formers (13-16)—have reported performance 
that begins to rival IFS’s at 1.0° and 0.25° for a 
handful of variables and lead times up to 7 days. 


GraphCast 


Here, we introduce an MLWP approach for 
global medium-range weather forecasting called 


SCIENCE science.org 


GraphCast, which produces an accurate 10-day 
forecast in under a minute on a single Google 
Cloud TPU (Tensor Processing Unit) v4 device 
and supports applications including predict- 
ing tropical cyclone tracks, atmospheric rivers, 
and extreme temperatures. 

GraphCast takes as input the two most re- 
cent states of Earth’s weather—the current time 
and 6 hours earlier—and predicts the next state 
of the weather 6 hours ahead. A single weather 
state is represented by a 0.25° latitude-longitude 
grid (721 by 1440), which corresponds to rough- 
ly 28 km by 28 km resolution at the equator 
(Fig. 1A), where each grid point represents a set 
of surface and atmospheric variables (listed 
in Table 1). Like traditional NWP systems, 
GraphCast is autoregressive: It can be “rolled 
out” by feeding its own predictions back in as 


input, to generate an arbitrarily long trajectory 
of weather states (Fig. 1, B and C). 

GraphCast is implemented as a neural net- 
work architecture, based on GNNs in an “encoder- 
processor-decoder” configuration (13, 17), with a 
total of 36.7 million parameters (code, weights, 
and demos can be found at https://github.com/ 
deepmind/graphcast). Previous GNN-based 
learned simulators (J8-20) have been very 
effective at learning the complex dynamics 
of fluid and other systems modeled by par- 
tial differential equations, which supports their 
suitability for modeling weather dynamics. 

The encoder (Fig. ID) uses a single GNN layer 
to map variables (normalized to zero-mean unit 
variance) represented as node attributes on 
the input grid to learned node attributes on 
an internal “multimesh” representation. The 
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Fig. 2. Global skill and skill scores for GraphCast a 
(A) RMSE skill (y axis) for GraphCast (blue lines) and 
Z500, as a function of lead time (x axis). Error bars ri 


intervals. The vertical dashed line represents 3.5 days, which is the last 12-hour 
increment of the HRES 06z/182z forecasts. The black line represents HRES, where 


lead times earlier and later than 3.5 days are from th 
initializations, respectively. (B) RMSE skill score (y axis) 


HRES, on Z500, as a function of lead time (x axis). Error bars represent 95% 
confidence intervals for the skill score. We observe a discontinuity in GraphCast's 


curve because skill scores up to 3.5 days are comput 
(initialized at 062/18z) and HRES's 062/182z initializati 


multimesh (Fig. 1G) is a graph that is spatially 
homogeneous, with high spatial resolution 
over the globe. It is defined by refining a reg- 
ular icosahedron (12 nodes, 20 faces, 30 edges) 
iteratively six times, where each refinement 
divides each triangle into four smaller ones 
(leading to four times more faces and edges), 
and reprojecting the nodes onto the sphere. 
The multimesh contains the 40,962 nodes from 
the highest-resolution mesh (which is roughly 
1/25 the number of latitude-longitude grid 
points at 0.25°) and the union of all the edges 
created in the intermediate graphs, forming a 
flat hierarchy of edges with varying lengths. The 
processor (Fig. 1E) uses 16 unshared GNN layers 
to perform learned message-passing on the mullti- 
mesh, enabling efficient local and long-range 
information propagation with few message- 
passing steps. The decoder (Fig. 1F) maps the 
final processor layer’s learned features from the 
multimesh representation back to the latitude- 
longitude grid. It uses a single GNN layer and 
predicts the output as a residual update to the 
most recent input state (with output normaliza- 
tion to achieve unit variance on the target re- 
sidual). See supplementary materials section 3 
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During model development, we used 39 years 
(1979-2017) of historical data from ECMWF’s 
ERAS (21) reanalysis archive. As a training ob- 
jective, we averaged the mean squared error 
(MSE) between GraphCast’s predicted states 
over N autoregressive steps and the correspond- 
ing ERAS states, with the error weighted by 
vertical level (see supplementary materials eq. 19). 
The value of N was increased incrementally 
from 1 to 12 (i.e., from 6 hours to 3 days) over the 
course of training, and the gradient of the loss 
was computed by backpropagation through 
time (22). GraphCast was trained to minimize 
the training objective using gradient descent, 
which took roughly 4 weeks on 32 Cloud TPU 
v4 devices using batch parallelism. See sup- 
plementary materials section 4 for further 
training details. Consistent with real deploy- 
ment scenarios, where future information is 
not available for model development, we eval- 
uated GraphCast on the held-out data from 
the years 2018 onward (see supplementary ma- 
terials section 5.1). 


Verification methods 
We verified GraphCast’s forecast skill compre- 


for further architectural details. 
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hensively by comparing its accuracy to that of 
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after 3.5 days are computed with respect to HRES's 00z/12z initializations. 

(C) ACC skill (y axis) for GraphCast (blue lines) and HRES (black lines), on 2500, 
as a function of lead time (x axis). (D) Scorecard of RMSE skill scores for 
GraphCast, with respect to HRES. Each subplot corresponds to one variable: U, V, 
Z, T, Q, 2T, 10U, 10V, and MSL. The rows of each heatmap correspond to the 
13 pressure levels (for the atmospheric variables), from 50 hPa at the top 

to 1000 hPa at the bottom. The columns of each heatmap correspond to the 
20 lead times at 12-hour intervals, from 12 hours on the left to 10 days on the 
right. Each cell's color represents the skill score, as shown in (B), where blue 
represents negative values (GraphCast has better skill) and red represents 
positive values (HRES has better skill). 


HRES on a large number of variables, levels, 
and lead times. We quantified the respective 
skills of GraphCast, HRES, and ML baselines 
with two skill metrics: the root mean square 
error (RMSE) and the anomaly correlation co- 
efficient (ACC). 

Of the 227 variable and level combinations 
predicted by GraphCast at each grid point, we 
evaluated its skill versus HRES on 69 of them, 
corresponding to the 13 levels of WeatherBench 
(8) and variables (23) from the ECMWF Score- 
card (24); see boldface variables and levels in 
Table 1 and supplementary materials section 
1.2 for which HRES cycle was operational dur- 
ing the evaluation period. In addition to the 
aggregate performance reported in the main 
text, supplementary materials section 7 pro- 
vides further detailed evaluations, including 
other variables, precipitation, regional per- 
formance, latitude and pressure level effects, 
spectral properties, blurring, biases, compar- 
isons to other ML-based forecasts, and effects 
of model design choices. 

In making these comparisons, skill was es- 
tablished on the basis of two key elements: 
(i) the selection of the ground truth for com- 
parison and (ii) a careful accounting of the data 
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assimilation windows used to infer this data 
from observations. We used ERAS as the ground 
truth for evaluating GraphCast, because it was 
trained to take ERA5 data as input and predict 
ERAS data as outputs. However, evaluating 
HRES forecasts against ERA5 would result in 
nonzero error on the initial forecast step. In- 
stead, we constructed an “HRES forecast at 
step 0” (HRES-fcO) dataset to use as ground 
truth for HRES. HRES-fcO contains the inputs 
to HRES forecasts at future initializations (see 
supplementary materials section 1.2), ensuring 
that each data point is grounded by recent ob- 
servations and that the zeroth step of HRES 
forecasts will have zero error. 

For a fair comparison, we had to ensure that 
the ERAS initial conditions for GraphCast were 
derived from assimilation windows that look 
no further into the future than those used by 
HRES. HRES initializations (00z/06z/12z/18z, 
where 00z means 00:00 UTC in Zulu conven- 
tion) always assimilate observations 3 hours 
into the future, whereas ERAS initializations 
assimilate observations 9 hours into the future 
at 00z/12z and 3 hours into the future at 06z/18z. 
This constrained the choice of initialization 
times for GraphCast to 06z/18z in all our re- 
sults. We used the same initializations for HRES 
when comparing performance up to 3.75 days. 
Beyond that, HRES archived forecasts are only 
available from 00z/12z initializations. The tran- 
sition from 06z/18z to 00z/12z initializations 
for HRES induced a small discontinuity in our 
plots, which is indicated by a vertically dashed 
line at the appropriate lead time. Supplementary 
materials section 5 contains further verification 
details, including details of the comparisons pro- 
tocol between GraphCast and HRES (supple- 
mentary materials section 5.2) and the effect of 
initialization lookahead on both models’ perfor- 
mance (supplementary materials section 5.2.2). 


Forecast verification results 


We find that GraphCast has greater weather 
forecasting skill than HRES when evaluated on 
10-day forecasts at a horizontal resolution of 
0.25° for latitude and longitude and at 13 vertical 
levels. Figure 2, A to C, shows how GraphCast 
(blue lines) outperforms HRES (black lines) on 
the Z500 (geopotential at 500 hPa) “headline” 
field in terms of RMSE skill, RMSE skill score 
[ie., the normalized RMSE difference between 
model A and baseline B defined as (RMSE, - 
RMSE;)/(RMSE;)], and ACC skill. Using 2500, 
which encodes the synoptic-scale pressure dis- 
tribution, is common in the literature, as it has 
strong meteorological importance (8). The plots 
show that GraphCast has better skill scores 
across all lead times, with a skill score improve- 
ment of around 7 to 14%. Plots for additional 
headline variables are given in supplementary 
materials section 7.1. 

Figure 2D summarizes the RMSE skill scores 
for all 1380 evaluated variables and pressure 
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Fig. 3. Severe event prediction. (A) Cyclone tracking performances for GraphCast and HRES. The x axis 
represents lead times (in days), and the y axis represents median track error (in kilometers). Error bars 
represent bootstrapped 95% confidence intervals for the median. (B) Cyclone tracking paired error difference 
between GraphCast and HRES. The x axis represents lead times (in days), and the y axis represents median 
paired error difference (in kilometers). Error bars represent bootstrapped 95% confidence intervals for the median 
difference (see supplementary materials section 8.1). (©) Atmospheric river prediction (IVT) skills for GraphCast 
and HRES. The x axis represents lead times (in days), and the y axis represents RMSE. Error bars are 95% 
confidence intervals. (D) Extreme heat prediction precision-recall for GraphCast and HRES. The x axis represents 
recall, and the y axis represents precision. The curves represent different precision-recall trade-offs when 
sweeping over gain applied to forecast signals (see supplementary materials section 8.3). 


levels, across the 10-day forecasts, in a format 
analogous to the ECMWF Scorecard. The cell 
colors are proportional to the skill score, where 
blue indicates that GraphCast had better skill, 
and red indicates that HRES had better skill. 
GraphCast outperformed HRES on 90.3% of the 
1380 targets and significantly (P < 0.05, nominal 
sample size n € {729, 730}) outperformed HRES 
on 89.9% of targets. See supplementary ma- 
terials section 5.4 for methodology and table 
S4 for P values, test statistics, and effective sam- 
ple sizes. 

The regions of the atmosphere in which 
HRES had better performance than GraphCast 
(top rows in red in the scorecards) were dis- 
proportionately localized in the stratosphere 
and had the lowest training loss weight (see 
supplementary materials section 7.2.2). When 
excluding the 50 hPa level, GraphCast signif- 
icantly outperforms HRES on 96.9% of the re- 
maining 1280 targets. When excluding levels 
50 and 100 hPa, GraphCast significantly outper- 
forms HRES on 99.7% of the remaining 1180 
targets. When conducting per-region evalua- 
tions, we found that the previous results gen- 


erally hold across the globe, as detailed in figs. 
S14 to S16. 

We found that increasing the number of 
autoregressive steps in the MSE loss improves 
GraphCast performance at longer lead times 
(see supplementary materials section 7.3.2). It 
also encourages GraphCast to blur to a degree 
at longer lead times (see fig. S38), which means 
that its forecasts will lie somewhere between a 
traditional deterministic forecast and an ensem- 
ble mean. HRES’s underlying physical equations, 
however, do not lead to blurred predictions. To 
assess whether GraphCast’s relative advantage 
over HRES on RMSE skill is due to blurrier 
forecasts better optimizing RMSE, we artifi- 
cially blurred HRES’s forecasts with blurring 
filters. We fit filters for GraphCast and HRES 
by minimizing the RMSE between filtered pre- 
dictions and the models’ respective ground 
truths. We found that RMSE-optimized blur- 
ring applied to GraphCast has greater skill 
than analogous blurring applied to HRES on 
88.0% of our 1380 verification targets, which is 
generally consistent with our above conclu- 
sions (see supplementary materials section 7.4). 
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Fig. 4. Training GraphCast on more 
recent data. Each colored line repre- 
sents GraphCast trained with data 
ending before a different year, from 2018 
(blue) to 2021 (purple). The y axis 
represents RMSE skill scores on 2021 
test data, for 2500, with respect to 
GraphCast trained up to before 2018, 
over lead times (x axis). The vertical 
dashed line represents 3.5 days, where 
the HRES 06z/18z forecasts end. The 
black line represents HRES, where lead 
times earlier and later than 3.5 days 
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are from the 06z/18z and 00z/12z 
initializations, respectively. 


Still, blurrier forecasts may not be desirable for 
some applications, which we discuss further 
in the Conclusions section. We also compared 
GraphCast’s performance to the top competing 
ML-based weather model, Pangu-Weather (6), 
and found that GraphCast outperformed it on 
99.2% of the 252 targets presented (see sup- 
plementary materials section 6 for details). 


Severe event forecasting results 


Beyond evaluating GraphCast’s forecast skill 
against HRES’s on a wide range of variables and 
lead times, we also evaluated how its forecasts 
support predicting severe events, including trop- 
ical cyclones tracks, atmospheric rivers, and ex- 
treme temperature. These are key downstream 
applications for which GraphCast is not specif- 
ically trained, but which are very important for 
human activity. 


Tropical cyclone tracks 


Improving the accuracy of tropical cyclone 
tracking can help avoid injury and loss of life 
and can reduce economic harm (25). A cy- 
clone’s existence and trajectory are predicted 
by applying a tracking algorithm to forecasts 
of geopotential (Z), horizontal wind (10U/10V, 
U/V), and mean sea level pressure (MSL). We 
implemented a tracking algorithm based on 
ECMWF’s published protocols (26) and ap- 
plied it to GraphCast’s forecasts to produce 
cyclone track predictions (see supplementary 
materials section 8.1). As a baseline for com- 
parison, we used the operational tracks obtained 
from HRES’s 0.1° forecasts with ECMWF’s own 
tracker, stored in the TIGGE (THORPEX Inter- 
active Grand Global Ensemble) archive (27, 28). 
The reason we used different trackers for 
GraphCast and HRES is because no single 
tracker should necessarily give best perform- 
ance across different forecasts, therefore, we 
used the best tracker for each model. We mea- 
sured errors for both models against the tracks 
from IBTrACS [International Best Track Archive 
for Climate Stewardship (29, 30)], a separate 
reanalysis dataset of cyclone tracks aggregated 
rom various analysis and observational sources. 
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To be consistent with established evaluation of 
tropical cyclone prediction (26), we evaluate 
all tracks when both GraphCast and HRES 
detect a cyclone, ensuring that both models are 
evaluated on the same events, and verify that 
each model’s true-positive rates are similar. 

Figure 3A shows that GraphCast has lower 
median track error than HRES over the period 
2018-2021 (median was chosen to resist outliers). 
As per-track errors for HRES and GraphCast 
are correlated, we also measured the per-track 
paired error difference between the two mod- 
els and found that GraphCast is significantly 
better than HRES for lead times of 18 hours to 
4.75 days, as shown in Fig. 3B. The error bars 
show the bootstrapped 95% confidence inter- 
vals for the median (see supplementary mate- 
rials section 8.1 for details). 


Atmospheric rivers 


Atmospheric rivers are narrow regions of the 
atmosphere that are responsible for most of 
the poleward water vapor transport across the 
mid-latitudes and generate 30 to 65% of an- 
nual precipitation on the US West Coast (37). 
Their strength can be characterized by the ver- 
tically integrated water vapor transport IVT 
(32, 33), indicating whether an event will pro- 
vide beneficial precipitation or be associated 
with catastrophic damage (34). JVT can be com- 
puted from the nonlinear combination of the 
horizontal wind speed (U and V) and specific 
humidity (Q), which GraphCast predicts. We 
evaluated GraphCast forecasts over coastal North 
America and the Eastern Pacific during cold 
months (October to April), when atmospheric 
rivers are most frequent. Despite not being 
specifically trained to characterize atmospheric 
rivers, Fig. 3C shows that GraphCast improves 
the prediction of VT compared with HRES, 
from 25% at short lead time to 10% at longer 
horizons (see supplementary materials sec- 
tion 8.2 for details). 


Extreme heat and cold 


Extreme heat and cold are characterized by 
large anomalies with respect to typical clima- 


tology (3, 35, 36), which can be dangerous and 
disrupt human activities. We evaluated the 
skill of HRES and GraphCast in predicting 
events above the top 2% of historical values 
across location, time of day, and month of the 
year, for 2T at 12-hour, 5-day, and 10-day lead 
times, for land regions across the Northern 
and Southern hemispheres over their respec- 
tive summer months. We plotted precision- 
recall curves (37) to reflect different possible 
trade-offs between reducing false positives 
chigh precision) and reducing false negatives 
(high recall). For each forecast, we obtained the 
curve by varying a “gain” parameter that scales 
the 2T forecast’s deviations with respect to the 
median climatology. 

Figure 3D shows that GraphCast’s precision- 
recall curves are above HRES’s for 5- and 10-day 
lead times, suggesting that GraphCast’s fore- 
casts are generally superior to those of HRES 
at extreme classification over longer horizons. 
By contrast, HRES has better precision-recall at 
the 12-hour lead time, which is consistent with 
the 2T skill score of GraphCast over HRES being 
near zero, as shown in Fig. 2D. We generally find 
these results to be consistent across other var- 
iables relevant to extreme heat, such as 7850 
(temperature at 850 hPa) and 2500 (geopotential 
at 500 hPa) (36), other extreme thresholds (5, 
2, and 0.5%), and extreme cold forecasting in 
winter. See supplementary materials section 
8.3 for details. 


Effect of training data recency 


GraphCast can be retrained periodically with 
recent data, which in principle allows it to cap- 
ture weather patterns that change over time— 
in response to, for example, the effects of cli- 
mate change—and long climate oscillations. 
We trained four variants of GraphCast from 
scratch, with data that always began in 1979 
but ended in 2017, 2018, 2019, and 2020, re- 
spectively (we label the variant ending in 2017 
as “GraphCast: <2018,” etc.). We compared their 
performances to HRES on 2021 test data. 

Figure 4 shows the skill scores (normalized 
by GraphCast:<2018) of the four variants and 
HRES for 2500. We found that while GraphCast’s 
performance when trained up to before 2018 is 
still competitive with HRES in 2021, training it 
up to before 2021 further improves its skill scores 
(see supplementary materials section 7.1.3). 
We speculate that this recency effect allows re- 
cent weather trends to be captured to improve 
accuracy. This shows that GraphCast’s per- 
formance can be improved by retraining on 
more recent data. 


Conclusions 

GraphCast’s forecast skill and efficiency com- 
pared with HRES shows that MLWP methods 
are now competitive with traditional weather 
forecasting methods. Additionally, GraphCast’s 
performance on severe event forecasting, which 
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it was not directly trained for, demonstrates 
its robustness and potential for downstream 
value. We believe this marks a turning point 
in weather forecasting, which helps open new 
avenues to strengthen the breadth of weather- 
dependent decision-making by individuals and 
industries by making cheap prediction more 
accurate and accessible as well as suitable for 
specific applications. 

With 36.7 million parameters, GraphCast is 
a relatively small model by modern ML stan- 
dards, chosen to keep the memory footprint 
tractable. And while HRES is released on 0.1° 
resolution, 137 levels, and up to 1-hour time 
steps, GraphCast operates on 0.25° latitude 
and longitude resolution, 37 vertical levels, 
and 6-hour time steps, because of the ERA5 
training data’s native 0.25° resolution and en- 
gineering challenges in fitting higher-resolution 
data on hardware. Generally, GraphCast should 
be viewed as a family of models, with the cur- 
rent version being the largest we can practi- 
cally fit under current engineering constraints, 
but which have the potential to scale much 
further in the future with greater compute 
resources and higher-resolution data. 

One key limitation of our approach is in how 
uncertainty is handled. We focused on de- 
terministic forecasts and compared against 
HRES, but the other pillar of ECMWF’s IFS, 
the ensemble forecasting system ENS, is es- 
pecially important for quantifying the prob- 
ability of extreme events and as the skill of the 
forecast decreases at longer lead times. The 
nonlinearity of weather dynamics means that 
there is increasing uncertainty at longer lead 
times, which is not well captured by a single 
deterministic forecast. ENS addresses this by 
generating multiple, stochastic forecasts, which 
approximate a predictive distribution over fu- 
ture weather; however, generating multiple 
forecasts is expensive. By contrast, GraphCast’s 
MSE training objective encourages it to spa- 
tially blur its predictions in the presence of 
uncertainty, which may not be desirable for 
some applications where knowing tail, or joint, 
probabilities of events is important. Building 
probabilistic forecasts that model uncertainty 
more explicitly, along the lines of ensemble 
forecasts, is a crucial next step. 

It is important to emphasize that data-driven 
MLWP relies critically on large quantities of data 
and their quality, which, in the case of models 
trained on reanalysis, depend on the fidelity of 
NWPs. Therefore, rich high-quality data sources 
such as ECMWF’s MARS (Meteorological Archi- 
val and Retrieval System) archive (38) are inval- 
uable. Our approach should not be regarded 
as a replacement for traditional weather fore- 
casting methods, which have been developed 
for decades, rigorously tested in many real-world 
contexts, and offer many features we have not 
yet explored. Rather, our work should be inter- 
preted as evidence that MLWP is able to meet 
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the challenges of real-world forecasting prob- 
lems and has the potential to complement and 
improve the current best methods. 

Beyond weather forecasting, GraphCast can 
open new directions for other important geo- 
spatiotemporal forecasting problems, includ- 
ing climate and ecology, energy, agriculture, 
and human and biological activity, as well as 
other complex dynamical systems. We believe 
that learned simulators, trained on rich, real- 
world data, will be crucial in advancing the role 
of machine learning in the physical sciences. 
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SUPERCONDUCTIVITY 


Time-reversal symmetry breaking superconductivity 
between twisted cuprate superconductors 
S. Y. Frank Zhao'+, Xiaomeng Cui+, Pavel A. Volkov'°, Hyobin Yoo’, Sangmin Lee®, 


Jules A. Gardener®, Austin J. Akey®, Rebecca Engelke’, Yuval Ronen’, Ruidan Zhong’+, 
Genda Gu’, Stephan Plugge®, Tarun Tummuru®, Miyoung Kim®, Marcel Franz®, 


Jedediah H. Pixley”, Nicola Poccia’®*, Philip Kim'* 


Twisted interfaces between stacked van der Waals (vdW) cuprate crystals present a platform for 
engineering superconducting order parameters by adjusting stacking angles. Using a cryogenic 
assembly technique, we construct twisted vdW Josephson junctions (JJs) at atomically sharp 
interfaces between Bi,SrzCaCu20g,, crystals, with quality approaching the limit set by intrinsic JJs. 
Near 45° twist angle, we observe fractional Shapiro steps and Fraunhofer patterns, consistent with the 
existence of two degenerate Josephson ground states related by time-reversal symmetry (TRS). 

By programming the JJ current bias sequence, we controllably break TRS to place the JJ into either of 
the two ground states, realizing reversible Josephson diodes without external magnetic fields. Our 
results open a path to engineering topological devices at higher temperatures. 


eak van der Waals (vdW) bonding be- 

tween neighboring atomic layers offers 

a distinct opportunity for engineer- 

ing atomic interfaces with controlled 

twist angles (7). Careful adjustment of 
the twist angle can create the spatial periodic- 
ity of a moiré superlattice (2) with narrow 
electronic bands and topological structure 
(3). Such twisted structures of various vaW 
materials, including graphene (3) and tran- 
sition metal dichalcogenides (4), have been 
shown to host a plethora of emergent elec- 
tronic states, including superconductivity (5), 
magnetism (6), Chern insulators (7), general- 
ized electronic Wigner crystals (8), and cor- 
related insulating states (9). 

Atomically layered cuprate high-temperature 
superconductors also offer a platform for 
twistronics by enabling the engineering of 
the coupling between nodal superconduct- 
ing order parameters (SOPs) across a twisted 
vdW interface (J0—15). In BigSrzCaCu2Og.7 
(BSCCO), superconducting CuO, bilayers are 
Josephson-coupled through insulating [SrO- 
BiO] bilayers (16), where the crystal can be 
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mechanically cleaved into atomically flat crys- 
tals (17, 18) exhibiting high-temperature super- 
conductivity even in the monolayer limit (19). 

Josephson junctions (JJs) formed at twist 
boundaries between cuprate crystals directly 
probe the pairing symmetry of Cooper pairs 
(20, 21). For example, using ab-planar tilt grain 
boundaries, measurements of angle-dependent 
critical currents (22), nonsinusoidal Josephson 
current-phase relations (23), and spontane- 
ous supercurrents (27) established the cup- 
rates as d-wave superconductors where the SOP 
changes sign when Cooper pair momentum is 
rotated by 90°. 

Similarly, interfacial Josephson coupling be- 
tween twisted nodal d-wave superconductors 
is strongly modulated by the twist angle (15). 
Particularly, at a twist angle of 6 = 45° direct 
Cooper pair tunneling is forbidden because of 
the complete mismatch between the d,._,p 
symmetric SOPs across the interface (75). Any 
remaining Josephson supercurrents are medi- 
ated through second-order processes (10-12, 24); 
the system is expected to support topological 
time-reversal symmetry (TRS)-breaking super- 
conducting phases persisting up to the junction 
superconducting transition temperature To 
(11, 12). Alternatively, TRS can be broken away 
from 6 = 45° by an applied current, which is 
predicted to induce a topological superconduct- 
ing state (13). 


Building twisted BSCCO junctions 


The preservation of surface superconductivity 
of BSCCO crystals after vdW stacking remains 
an outstanding experimental challenge (J7, 18). 
BSCCO crystals react with moisture (19, 25), and 
their oxygen dopants become mobile above 
200 K (9, 26). BSCCO twist junctions previous- 
ly required high-temperature oxygen anneal- 
ing to restore interfacial superconductivity 
(27-31), often at the cost of considerable inter- 


facial structural reconstruction (29, 31-33), sup- 
pressed 7, (28-37), low critical current density 
Jc at zero twist angle (27-29, 31), and pro- 
nounced anomalies in the resistance-temperature 
(R-T) curves (28, 29, 37). Most experiments ob- 
served no angular sensitivity (27-29), except two 
(30, 31) where the Josephson coupling angular 
dependence deviated strongly from conven- 
tional models of d-wave superconductivity (15). 

We overcome these challenges by developing 
a cryogenic, solvent-free vdW transfer tech- 
nique in pure argon using a liquid nitrogen- 
cooled stage kept at <—90°C. We cleave an 
exfoliated BSCCO crystal into two copies be- 
tween BiO planes, while thermally freezing 
out oxygen migration and other chemical pro- 
cesses at the surface (Fig. 1A and fig. S1). One 
of the crystals is quickly rotated to the targeted 
twist angle and reassembled with the other. A 
Josephson junction forms in the overlapping 
region upon contact (Fig. 1, B to D). Two sets of 
electrical contacts, both defined by stencil masks 
and evaporated on a —30°C cold stage (/7), 
are prefabricated nearby before cleaving and 
placed on the top surface of the bottom crystal 
after reassembly. This contact geometry probes 
the twist junction while minimizing bulk crys- 
tal contributions (Fig. 1H, upper inset). Addi- 
tional details are given in section S1 of (34). 

Cryogenic handling in argon is critical to 
maintaining pristine atomic interfaces without 
interfacial reconstruction and oxygen dopant 
changes. Figure 1D shows cross-sectional high- 
angle annular dark field (HAADF) scanning 
transmission electron microscope (STEM) im- 
age of a 0 = 46° junction, simultaneously view- 
ing both BSCCO crystals on two different zone 
axes. Notably, crystalline order is maintained 
at the interfacial layer with no change to the 
lattice structure (35) in the [001] direction be- 
tween atomic layers (Fig. 1E) or along CuO, 
planes (Fig. 1, F and G, and figs. S2 to S4). De- 
tailed analysis of the interfacial BiO atomic 
layer places an upper bound of the twist-angle 
disorder below 0.2° [section S2 of (34)]. Sim- 
ilar interface quality is uniformly observed across 
large areas in several additional junctions (see 
figs. S2 to S4, for example). We compare 26 de- 
vices with 8 ranging from 0° to 180% with T¢ > 
79 K and average Tc of 84 K (Fig. 1H, lower 
inset), demonstrating high oxygen dopant uni- 
formity even at the junction. Neither 7, nor 
the normal-state conductivity are systemati- 
cally correlated with 6 (fig. S8). 

At 6 = 0S our devices exhibit electronic char- 
acteristics similar to the intrinsic Josephson junc- 
tions (IJJs) that naturally form between CuO, 
layers in BSCCO single crystals (16), demonstrating 
the high interfacial quality of our JJs. Figure 11 
shows the current-voltage (/-V) characteristics 
(IVCs) measured with four terminals at tem- 
perature T = 9 K. In this low-temperature re- 
gime, the JJ exhibits a large hysteresis. As we 
increase current bias from a large negative value, 
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Fig. 1. Twist Josephson junctions with intrinsic junction quality. 

(A) Schematic of key fabrication step: a single BSCCO crystal is cleaved using 
polydimethylsiloxane (PDMS) below -90°C. (B) Optical micrograph of a BSCCO 
twist junction. Dashes outline identical copies of the same crystal. Corresponding 
schematic in upper inset of (H). (C) Atomic force microscope topography 
showing atomically flat interface. Line trace shows topography along dotted line. 
(D) Cross-sectional annular dark field scanning TEM image of 8 = 46° junction, 
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highlighted in (E), showing atomically sharp structural transition at the interface. 
(G) BiO layer lattice periodicity variation, 8 /k, and peak full width at half 
maximum (FWHM) from (F), whose fluctuations are within 0.05%, with peak 
FWHM at fast Fourier transform resolution (0.014 nm”). (H) In-plane 
resistance in each bulk crystal versus resistance through the artificial junction 
between them, showing nearly identical junction Tc. Lower inset shows Tc 
distribution among all 26 JJs in the angle-dependence analysis. (I) /-V curve for a 


showing bulk-like crystalline order at the interface. Br 


of atoms identified in the upper right corner. (E) Integrated intensity of each 


layer. arb, arbitrary units. (F) Fourier transform of TE 


the resistive quasiparticle branch of the IVC 
undergoes a series of jumps (green arrows) be- 
fore retrapping into the superconducting state. 
Similar jumps were seen in IJJs [see section S5 
of (34)] (16). Under positive bias current, V first 
becomes nonzero at J, (Fig. 11, inset) before 
jumping about 20 mV to the resistive state at 
the critical current J,, marked by the blue tri- 
angle. The small voltage rise at I~; was previ- 
ously seen in some IJJs (16, 36, 37) and was 
attributed to self-field effects (16, 38), whereas 
the V jump at Jc is comparable in magnitude 
to the corresponding jump in IJJs (16, 39, 40). 
Upon reversing the bias current polarity (dashed 
line), the JJ’s J-V behavior is mirrored along 
I = 0. Normalizing to junction area, we obtain 


ight spots are columns 


line cuts at BiO layers 


this junction, similar to the Jc of intrinsic junc- 
tions (47). Together, these observations indicate 
that our 8 = 0° JJ reaches electronic quality 
comparable to IJJs of single-crystal BSCCO. 


Twist angle and temperature—dependent 
transport 


To compare transport characteristics of differ- 
ent twisted JJs, we normalized the bias cur- 
rent J with the junction normal resistance 
Ry [see section S4 of (34) for experimental def- 
inition]. Because J. and Ry are both propor- 
tional to the area of the junction, the product 
I-Ry is independent of junction area. Figure 
2A shows the normalized dynamic resistance 
[dV/dI]/Ry as a function of T and [Ry at 0 = 0°, 


a critical current density Jo = 12 kA/cm? for 
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31° and 44.9° (similar data for 26 JJs studied 


@ = 0° junction in both sweep directions (arrows). Blue triangle highlights Jc 
comparable to |JJs. Green triangles highlight inelastic scattering features seen 
at the same voltages in |JJs (16). Inset shows low-voltage region of IVC. 


are shown in fig. S11). Several features are ap- 
parent in these datasets. First, as the current 
sweeps from left to right, on the retrapping side 
(Ry < 0), constant-voltage inelastic tunneling 
features appear in arcs of constant V. Next, on 
the switching side (/Ry > 0), both V and dV/dI 
jump at critical current Jc, which depends on 
T. The detailed behavior of J.(7)Rx depends 
on 0, as we detail below. Finally, we find the 
hysteresis of JJs to be reduced in the high- 
temperature regime as J((T)Ry decreases. 
The analysis of JcRy for 26 devices with 0 
between 0° and 180° indicates that the mag- 
nitude of J-Ry becomes smaller closer to 45° 
and 135°, where the JJs also appear less hys- 
teretic. In Fig. 2B, we plot J-Ry at two rep- 
resentative temperatures, 12 and 30 K, as a 
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Fig. 2. d-wave SOP symmetry revealed by supercurrent tunneling. 

(A) Normalized differential resistance [dV/dl]/Ry versus characteristic voltage /Ry 
and temperature T. Current is swept to the right. Blue arrows highlight IR. 

(B) Angular dependence of IcRy at 30 and 12 K. The points follow the |cos (26) | 
curve predicted for nearly incoherent tunneling between d-wave superconductors 

(15). (Inset) Schematic diagram of the Fermi surface of both crystals, with sign and 


function of a new variable 6 = 6 mod x/2. We 
observe that IcRn (6) follows |cos (20) , which 
is expected for nearly incoherent Cooper pair 
tunneling between d-wave superconductors 
(5). Similar angular dependence is seen in 
junction critical current density and junc- 
tion voltage just above the critical current V 
Ue) (see fig. $8). In conventional tunneling 
JJs, eVUIc) = 2A (42). In BSCCO IJJs, eV) 
usually reaches about half the spectroscopic 
gap value (J6). 

The temperature dependence of Josephson 
coupling in our twisted junctions provides 
further insight into the pairing symmetry of 
the Cooper pairs in BSCCO. Figure 2C shows 
I.(T)Ry for several representative JJs with dif- 
ferent 6. For 6 ~ 0, we find that J((7)Ry mono- 
tonically decreases as T increases, approximately 
following the theory curve for nearly incoher- 
ent tunneling between d-wave superconduc- 
tors [dashed line; see section $8 of (34)]. As 6 
increases, however, a surprising nonmonotonic 
behavior of J((7)Ry appears. For example, for 
6 = 29° and 39° in Fig. 2C, Io(T)Rn increases 
alongside 7, reaching a maximum value at T = 
Ty, and then decreases as T approaches To. 
More quantitative analysis can be found in Fig. 
2, D and E, where we plot the low-temperature 
slope dUcRy)/dT (dashed lines in Fig. 2C and fig. 
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$12) and observed Ty, respectively. These plots 
show a nonmonotonic behavior of JcRy(T), sig- 
naled by the positive slope of J-Rx(T) at low 
temperatures with finite 7),, appearing within 
|6 — n/4| < n/8. 

The strong 0 dependence of the nonmono- 
tonic [-(T)Ry in Fig. 2, D and E, points to SOP 
d-wave symmetry as its origin. For this ex- 
planation, we consider a gap function A, »(k) 
superimposed on top of the Fermi surface 
Ey’ (k) (43), where k is the in-plane Cooper 
pair wave vector in the first Brillouin zone and 
the index 1 or 2 denotes the top and bottom 
layer of BSCCO, respectively. In the twisted JJ, 
E(k) and E}(k) are rotated relative to each 
other by angle 0 (Fig. 2B, inset). At @ ~ 0, Ei (Kk) 
and E(k) overlap almost completely and 
A,(K)A.(k) is >0, yielding a uniformly positive 
contribution to critical current for coherent 
tunneling (75). As 0 increases to ~10°, the Fermi 
surfaces overlap at two points per quadrant in 
k-space, but with opposite phase difference 
between layers, yielding nodal and antinodal 
contribution where A,(k)A,(k) alternates in 
sign. Because the supercurrents from these 
two components carry the opposite sign, their 
contributions to the total critical current com- 
pete. As the gap in the nodal region is much 
smaller than the antinodal one, nonmonotonic 


magnitude of superconducting gap A(k) superposed in color. At @ > 10° Fermi surfaces 
intersect at two points per quadrant (circles) with different relative signs of SOPs. 

(C) Temperature dependence of the critical current for select devices. Dashed lines 
are linear fits to the low-temperature data. Ty is the temperature where Ic is maximal. 
Gray theory line shows expected /cRy(T) behavior [see section S8 of (34)]. (D) The slope 
of the low-temperature linear fit, d(/cRy)/dT. (E) Ti as a function of angle 6. 


temperature-dependent /,(7) is expected for 
Cre m/8, where the competition is strongest 
[see section S8 of (34) and (44) for more quan- 
titative discussion]. We also note that near 
Oen /4, the JJ coupling is strongly suppressed 
but remains nonzero. For the 0 = (44.9+.1)° 
junction, Josephson critical current can be 
measured up to 79 K, with J¢Ry about two 
orders of magnitude smaller than the 0° value. 


Second-order Josephson coupling near 6 = 45° 


The origin of the finite supercurrent near 45° 
is encoded in the Josephson current-phase re- 
lation (CPR) j(@) = j;sin(~) + j2sin(2~), where 
jand @ are the Josephson current density and 
phase, respectively (10, 12, 24). At 0 = 45°, the 
maximally mismatched SOP phase eliminates 
the conventional direct Cooper pair tunnel- 
ing term j;. The supercurrent must then flow 
through second-order mechanisms, which are 
predicted to support an interfacial supercon- 
ductivity with doubly degenerate Josephson 
energy owing to the inherent d,2_,2 symmetry 
of the SOP within each flake (10-12). The CPR 
corresponding to this unusual SOP leaves an 
experimental signature in the in-plane mag- 
netic interference (“Fraunhofer”) pattern and 
microwave-induced Shapiro steps in the IVC, 
which are both sensitive to the presence of two 
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emerge close to 6 = 45°. (A) Response to in-plane magnetic field B,, at @ = 44.9° 
One well-developed Fraunhofer pattern corresponding to d = 28 nm (14 nm for 

a second-harmonic CPR) appears, as indicated by the gray dashed line. (Left inset) 
Junction schematic. The Meissner currents (white lines) in the flakes affect the 
phase difference at the twist junction, enhancing its effective thickness d for the 
magnetic flux (42) [see section S13 of (34)]. (Right inset) VCs at fields highlighted 
with arrows in the main panel. (B) 8 = 46.3° JJ. Two Fraunhofer patterns with 


across the junction and microwave illumination power PH? at 70 K. dV/adl dips (white, 
pink) correspond to Shapiro steps. (Inset) /-V characteristic with half-integer Shapiro 
steps. (D) Fourier transform of dV/di(V) and (E) the spectral power at v; = 1 

and 2-2e/hf for the 6 = 44.6° JJ. (Inset) Schematic of the junction free energy 
F versus Josephson phase as twist angle changes. At 45°, the second harmonic 
dominates the current-phase relation. (F, H, and I) As in (C) to (E), but for the 
43.7° device, where only integer Shapiro steps appear. (G) Representative dV/dl for 


different magnetic field periodicity appear (dashed lines), with an estimated 
d = 15 nm (gray) and d =1.5 nm (red). (Inset) /-V characteristics at two different 


degenerate local minima in Josephson energy 
that are connected by TRS (JO, II, 24). 

Figure 3, A and B, shows Fraunhofer inter- 
ference patterns (FIPs) obtained at two differ- 
ent angles by applying parallel magnetic field 
B,. Figure 3A shows 0 = 44.9° JJ with I¢(B)) 
oscillation with a period about 20 times shorter 
than that expected for intrinsic junctions. At 
a nearby angle 0 = 46.3° (Fig. 3B), the short- 
period oscillations appear to coexist with a 
long-period oscillation characteristic of IJJs 
(41). A reduction in the FIP period implies an 
increase in d, the thickness of the interfacial 
region where magnetic field penetrates owing 
to field-induced currents extending into the 
crystal bulk (42). The ratio d/t, where t is 
the total junction thickness, depends only 
on the properties of the crystal and junction 
geometry and should not depend strongly on 


SCIENCE science.org 


the twist angle [see section S13 of (34)]. Not- 
ably, for the devices in Fig. 3, A and B, which 
share similar geometry, we obtain d/t ~ 0.1 for 
both devices only if we assume that the 44.9° 
junction is coupled purely through the second- 
order process with a doubled FIP period. 

The presence of higher-order harmonics in 
the CPR near 45° is further revealed by mea- 
suring Shapiro steps in /-V under microwave 
illumination of frequency f (Fig. 3, C and F, 
inset) (11, 24). In conventional JJs, where the 
CPR is dominated by the first harmonic of ¢, 
Shapiro steps appear as plateaus of constant 
voltage whenever V approaches 7-hf/2e, where 
nis an integer. We observe these conventional 
integer Shapiro steps in the JJs substantially 
away from 6 = m/4, as Shown in the @ = 43.7° 
device (Fig. 3F). Consistent with the FIP dis- 
cussed above, the experimentally observed 


all four devices with Shapiro step measurements at different twist angles. Half- 
integer Shapiro steps are only observed in junctions closest to 45° 


Shapiro steps also show signatures of the 
second-harmonic CPR as 6 approaches 1/4 (Fig. 
3C): specifically, when our devices are within 
(45+1)° additional steps at half-integer 7 ap- 
pear. As shown in Fig. 3C, microwave illumina- 
tion causes a series of dV/dI dips corresponding 
to steps in /-V at both integer and half-integer 
n to emerge from the symmetric /-V curve of 
an overdamped JJ. 

The Fourier components of dV/dI(V) show 
the relative strengths of integer and half-integer 
Shapiro steps. For the conventional Shapiro 
steps appearing in the 43.7° device, the Fourier 
transform shows dominant spectral power for 
the first harmonic v, = 2e/Af (Fig. 3, H and I). 
In contrast, for the devices exhibiting half- 
integer Shapiro steps (e.g., the 44.6° device at 
low microwave power), the Fourier transform is 
dominated by the second harmonic vz = 4e/hf: 
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The corresponding dV/dI shows dips of sim- 
ilar strength at half-integer and integer steps, 
indicating that second-order processes domi- 
nate over the conventional Josephson coupling 
close to 45°. 

Similar second-harmonic Josephson effects 
have been observed in parallel-connected ar- 
rays of 0- and n-JJs, where multiple degenerate 
ground states can form (45-48). Electronically, 
Josephson currents across distributions of facets 
in such junctions create disordered magnetic 
interference patterns, often with prominent 
peaks symmetrically centered about zero mag- 
netic field (49-52). This is at odds with our 
Fraunhofer-like J,(B)) measurements. More- 
over, our TEM analysis [see section S2 of (34)] 
quantitatively bounds our interfacial twist an- 
gle disorder to 0.2°, which excludes the scenario 
of angle inhomogeneity-induced TRS breaking 
in this device (53). 


Current-trainable Josephson diode effect 


The sign of j, in CPR plays an important role 
in the quantum properties of twist Josephson 
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Fig. 4. Time-reversal symmetry breaking Josephson diode effects. 

(A) Junction I-V characteristics obtained from a JJ with 6 = 39.5° at 12 K, 
revealing four distinct Jc accessible via different current sweep directions and 
histories. Schematics show RCSJ phase particle on the washboard potential just 
before reaching Jc. (B and €) Junction current bias (dashes) and voltage 
response (lines) across training (gray shading) and test pulses. For each 
subpanel, identical training pulses place the RCSJ phase particle in the same 
state (schematics). Insets show IVC of each test pulse. (D) Junction response 
to consecutive training and test pulses, showing controllable behavior. 
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junctions when it exceeds 7, in magnitude. 
When jp is negative, the JJ ground state splits 
into a pair of degenerate states with complex 
phase angle ~ = +@g lying between 0 and x 
(12, 24) (Fig. 3E, inset). When 0 is tuned slight- 
ly away from 45°, both CPR harmonics coexist, 
creating asymmetric potential barrier heights 
around each of the degenerate energetic minima. 
We observe such TRS breaking through an 
asymmetric critical current, when the Josephson 
phase particle begins to roll down the resistively 
and capacitively shunted Josephson (RCSJ) 
washboard potential at different tilt angles, 
corresponding to different bias currents, in 
each direction (Fig. 4A, schematic) (54). 

We find that twist JJs slightly away from 
6 = 45° exhibit nonreciprocal Josephson crit- 
ical current, which can be controlled by the 
current sweep sequence. In this scheme, one 
can controllably prepare the TRS broken initial 
+o States by the “training current sequence,” 
as shown in Fig. 4A. We use both positive and 
negative direction (+/—) and full and half sweep 
(F/H) to control the location of the phase par- 


ticle during the retrapping process with high 
probability (24), where the switching critical 
current density i is defined from super- 
current to normal current transition. We note 
that Jo sh He ad ue reflecting the symmetry 
between +g States due to TRS in the absence 
of magnetic field. 

Controlled placement of Josephson phase 
particles in a specific +@, valley (broken TRS) 
and the directionally asymmetric barrier asso- 
ciated with each valley (broken spatial inver- 
sion symmetry) are the critical ingredients for 
a Josephson diode (55-64), as shown in Fig. 4, 
B and C. Here, by biasing the “training” current 
into the voltage state in the forward (reverse) di- 
rection with the amplitude between ee | and 
then back to zero, the running JJ phase par- 
ticle is retrapped into the —@p (or +@o) valleys, 
respectively. A subsequent test sweep measures 
a critical current whenever the RCSJ washboard 
potential is tilted sufficiently to overcome the 
potential barrier, giving rise to asymmetric 
critical current densities |J, | and |J#_|. Note 


that we can flip the polarity of this Josephson 
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(E) Temperature-dependent switching current distributions of the four different 
critical currents. (F) Difference between mean full and half sweep AJ = 


2 


F/H H/F 
J + — J 


versus twist angle @, and the standard deviation measured at 30 K. Schematics 
show the shape of the potential with respect to twist angle, with the phase 
particle in the same well before switching. (G) Difference between positive and 
negative AJ, = Le = Si, normalized by average Javg = ie + Si") /2 


measured at 30 K. Schematic shows corresponding potential shape near 45°, 
where the phase particle is in different wells before switching. 
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diode by flipping polarity of the training cur- 
rent (Fig. 4C). In this way, we repeatedly and 
controllably prepare the JJ into either ground 
state (Fig. 4D). At higher temperatures, the 
phase dynamics become damped, which hin- 
ders the processes depicted in Fig. 4A by trap- 
ping ¢ in the local minima of the washboard 
potential. As a result, the Jo ue asymmetry be- 
comes less prominent upon heating and van- 
ishes at about 50 K (Fig. 4E). 

In this study, we investigated the Josephson 
diode effect in several JJs with different twist 
angles. For each JJ, the critical current differ- 
ence AJ = Ja aa —JIct ps/F reflects the asymmetry 
in potential ere ‘for each +@o valley (Fig. 
4F), whereas the difference AJ, = ee Joe Hib 
reflects asymmetries between the two free en- 
ergy minima (Fig. 4G). In both cases, the dif- 
ferences are strongest near 43° and weaken at 
both 45° and ~35°, where 7; and j terms be- 
come negligible, respectively. Away from 45°, 
the absence of the diode effect is caused by the 
restoration of time-reversal symmetry. At exact- 
ly 45°, on the other hand, the purely second- 
harmonic Josephson potential has an additional 
symmetry around +p: +Qp + P @ +o F , 
which forbids the diode effect. Therefore, the 
diode effect is expected to vanish at 45° and at 
sufficiently low twist angles, which is consist- 
ent with our results. Thus, the observed diode 
effect is closely related to the free energy land- 
scape of coexisting first- and second-harmonic 
CPR terms. The asymmetry between the two 
ground state valleys as observed in Fig. 4G can 
arise from an additional cos term in the CPR 
[see section S14 of (34) for details]. Our dem- 
onstration of the polarity-tunable Josephson 
diode effect in near-45° twist BSCCO JJs thus 
provides strong experimental evidence for 
TRS broken superconductivity based on the 
emergent interfacial SOP. 
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Facing fraud in my lab 


n the Monday when C.B. came into my office, his worried face told me the news was bad. Three 
weeks earlier, two of my postdocs had appeared at my door, barely holding back tears. They 
were certain that another postdoc colleague had fabricated his work because they had never 
seen him—let’s call him John—use the equipment necessary for the latest data he had pre- 
sented in our lab meeting. Initially, I was certain they were mistaken. I had already presented 
John’s data at several national conferences, and we were writing a paper I thought would have 
a huge impact on our field. But I could not ignore their suspicions. I phoned the university integrity 
officer and met with my dean. I also hired C.B., a former senior scientist in my lab, to repeat the most 


critical experiments. Now, the fraud was confirmed. 


I was devastated. What should I do? 
How could I face my colleagues and 
the people working in my lab? 

It wasn’t easy on any of us. The 
two postdocs who had _ identified 
the fraud had spent countless hours 
in the lab with John and felt guilty 
about informing on him. They wor- 
ried they might be wrong, that they 
might be needlessly ruining the life 
of someone they liked and respected. 
They were distraught when he sent 
them sorrowful emails, and finally 
terrified when he left angry messages 
on their home answering machines. 
We had to call the campus police and 
change the locks on our lab doors. 
John eventually announced he woul 
be leaving with his wife, who had 
found a job in another state. 

In the weeks that followed the discovery, university inves- 
tigators rifled through lab notebooks and sorted through 
freezer boxes full of samples. Machines, printouts, and com- 
puters were carted away to be forensically examined. Lab 
members gave recorded testimony, and I met with a faculty 
board of enquiry. My dean, though sympathetic, probed 
John’s possible motivation as well as why I had failed to 
prevent the fraud. Although everyone strained to remain 
polite, it was clear they faulted me, too, for accepting John’s 
data so readily. As a senior scientist, I felt humiliated, but 
I also felt I deserved the humiliation. Why had I not been 
more vigilant? 

I was forced to confront my own complicity. I understood 
perfectly why I had been duped. Although I was well-known 
in my field, had presented keynote lectures at major confer- 
ences and mentored successful junior scientists, I longed to 
be known for making a true breakthrough, for discovering 
something amazing. Thus, after seeing John’s exciting initial 


[felt humiliated. ... Why had | 
not been more vigilant?” 


results, I stopped reviewing his origi- 
nal data. I wanted to believe the find- 
ings underpinning the novel concepts 
were real, and I was satisfied with the 
beautiful graphs he produced, which 
I imagined would bring us recogni- 
tion as conceptual innovators. 

I was profoundly disappointed 
in myself. I had barely escaped the 
shame of having to retract a published 
article, and the investigation and the 
monthslong impact on my working 
life were wrenching. For my lab, the 
aftermath was so disruptive that sev- 
eral people sought counseling. 

I retired several years after this inci- 
dent. With time to consider what had 
occurred, I began to wonder how fre- 
quently other lab directors are forced 
to confront their own errors and cul- 
pability. At the time, I felt more personally guilty than I cared 
to admit, and I concealed the fraud from my colleagues. I 
now think that was the wrong approach; others could surely 
have learned from the mistakes I made. 

John’s fraud left me feeling personally violated, as if a thief 
had come to my home and rifled through my drawers and 
closets. Scientific fraud is, indeed, a kind of theft. Apart from 
the wasted time and ill-spent money, it is a blow to one’s repu- 
tation, and it damages a lab’s credibility. 

The worst of it was that my deepest beliefs had been 
shaken. I had depended on the idea that scientific re- 
search is a sacred trust, not only to the other individu- 
als in one’s group, but also to the research enterprise as 
a whole. John’s betrayal was a warning lesson—one I am 
sorry I had to learn. & 


Rosalind A. Coleman is a professor emeritus at the University of North 
Carolina at Chapel Hill. Send your career story to SciCareerEditor@aaas.org. 
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